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INTRODUCTION AND OVERVIE 


Am29200 AND Am29205 RISC MICROCONTROLLERS 


The Am29200 and Am29205 microcontrollers are part of a growing family of 32-bit 
reduced-instruction set (RISC) processors employing submicron circuits to increase the 
degree of system integration, yielding very low system cost. Dense circuitry and a large 
number of on-chip peripherals minimize the number of components required to imple- 

. ment embedded systems, while providing performance superior to that of complex- 
instruction-set (CISC) microprocessors. Systems implemented with the Am29200 or 
Am29205 microcontroller can achieve higher performance at lower cost than existing 
systems. The Am29200 and Am29205 microcontrollers are binary compatible with all 
other members of the 29K Family, further broadening the price/performance range of the 
29K Family. 


The Am29200 and Am29205 microcontrollers were designed expressly to meet the 
requirements of embedded applications such as laser printers, graphics processing, 
application program interface (API) accelerators, X terminals and servers, and scanners. 
‘Such applications make the following demands on system design: 


m Performance at low cost: A processor must interface with memory and peripherals 
with a minimum number of external components. 


m Design flexibility: One basic design must be extensible to an entire product line. 


m Reduced time-to-market: A complete set of development, debug, and benchmarking 
tools is critical for reducing product development time. 


m A rational, easy upgrade path: The processor family must provide bus- and software- 
compatibility so processor upgrades are transparent to both hardware and software. 


The Am29200 and Am29205 microcontrollers are optimized for any embedded applica- 
tion requiring better-than-CISC performance at minimal system cost. The electronic 
components for many systems, such as personal laser printers, amount to little more 
than the Am29200 or Am29205 microcontroller, ROM, DRAM, and electrical buffering. 


DESIGN PHILOSOPHY 


The 29K Family of processors results from a design philosophy that considers processor 
performance in light of the processor’s hardware and software environment. The key to 
maximizing performance is understanding that the processor is part of an integrated 
system, and is itself a collection of components that must be properly integrated. 


Processor features must be considered not only on their own merits, but also in relation 
to other components of the system. A particular feature that, while considered alone may 
increase one aspect of processor performance, may actually decrease the performance 
of the total system, because of the burden it places elsewhere in the system. 
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As an illustration, consider the factors involved in the execution time of any processor 
task: 


Task Time = (Instructions / Task) « (Cycles / Instruction) « (Time / Cycle) 


To minimize the time taken, it is necessary to minimize the above product. This is not 
equivalent to minimizing all the terms that contribute to the product; in fact, this is 
generally not possible due to the interaction of the terms. 


As an example of the interaction of the previous terms, consider the number of instruc- 
tions required for a task. An attempt to minimize this number, a more or less traditional 
approach to processor architecture design, increases the average number of cycles 
required for the execution of an instruction, because of the increased number of 
operations performed by each instruction. In addition, cycle time is increased because of 
instruction-decode time. 7 


A second example of the interaction in the previous equation appears in an attempt to 
reduce the cycle time through the pipelining of operations. In theory, the cycle time can 
be made arbitrarily small by the definition of an arbitrarily large number of pipeline 
stages. In practice, at least in the case of general-purpose processors, pipelining rarely 
yields much of its potential benefit. This is due to situations where the pipeline cannot be 
kept fully occupied, such as when memory references and branches occur. In these 
situations, additional pipeline stages increase the number of cycles required for an 
operation, and thus affect the Cycles / Instruction term. 


Optimum Performance 


Each of the terms in the previous equation has some minimum bound for a given 
implementation technology and task. In general, this minimum bound cannot be © 
approached without an offsetting increase in the other terms, making the overall product 
less-than-optimum. The question then arises, what combination of terms will yield an 
optimum product? There are several things to note when answering this question. 


The first observation is that the number of operations underlying a given task is more or 
less fixed. Any single processor ultimately limits the time required for a task because it 
has a single execution unit and a single instruction stream. The operations that must be 
performed are reflected in the Instructions / Task and Cycles / Instruction terms. These 
operations may be performed by relatively few instructions, where each instruction takes 
multiple cycles to execute, or by a larger number of instructions, where each takes a 
single cycle to execute. In the first case, the instructions are complex; in the second, 
they are simple. 


The point is that the trade-off between simple and complex instructions is not one-to- 
one. For example, reducing the number of cycles per instruction by a factor of three 
does not increase the number of instructions per task by the same factor. There are two 
reasons for this. The first is that even when an instruction set supports complex opera- 
tions, a large proportion of the instructions that are executed perform operations that 
could be performed as well by simple instructions. The second is that simple instructions 
expose more of the internal processor operation to an optimizing compiler. This allows 
the compiler to tailor the organization and sequence of operations to the task at hand, 
thereby reducing the total number of instructions executed. 
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Performance Leverage 


Another important observation is that there is a tremendous amount of leverage in the 
Time / Cycle and Cycles / Instruction terms. As they are made smaller, they have a 
proportionally greater effect on performance. 


For example, a reduction of 10 ns in the cycle time of a processor operating with a 
200-ns cycle time yields an increase of 5% in the processor’s performance. The same 
improvement in a processor operating with a 50-ns cycle time yields a 20% increase in 
performance. 


Correspondingly, a reduction of 0.2 in the number of cycles per instruction in a processor 
averaging 5 cycles per instruction yields a 4% increase in performance. However, the 
same reduction yields a 12.5% performance increase in a processor that averages 1.6 
cycles per instruction. 


Conclusion 


It is possible, and desirable, to increase the number of instructions executed for a given 
task, and more than make up for the performance impact of this increase by reductions 
in the cycle time and in the number of cycles per instruction. For example, if both the 
cycle time and the number of cycles per instruction are reduced by a factor of three, 

_ while the number of instructions for a given task is allowed to grow by 50%, the resulting 
task time is reduced by a factor of six. 


The Am29200 and Am29205 microcontrollers were designed with the above effects in 
mind. Maximum performance is obtained by the optimization of the product of the 
number of instructions per task, the number of cycles per instruction, and the cycle time, 
not by minimizing one factor at the expense of the others. This is accomplished by 
careful definition of all processor components. In particular: 


m= The /nstruction / Task term is optimized by the definition of simple instructions. The 
processor provides an efficient instruction set and a large number of general-pur- 
pose registers to an optimizing, high-level language compiler. Most reductions in 
this term are accomplished by the compiler. The number of instructions for a given 
task may be greater than the number of instructions for processors with complex 
instruction sets. However, this increase is more than offset by other improvements in 
processor performance. 


mg The Cycles / Instruction term is optimized by the data-flow structure and perfor- 
mance-enhancing features of the processor. A large amount of processor hardware is 
dedicated to achieving an average instruction- execution rate that is close to single- 
cycle execution. 


u The Time /Cycle term is optimized by the implementation technology, the processor 
system interface, and judicious use of pipelining. The simplicity of the instruction set 
and processor features helps minimize the cycle time. 
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PURPOSE OF THIS MANUAL 


This manual describes the technical features, programming interface, on-chip peripher- 
als, and complete instruction set of the Am29200 and Am29205 microcontrollers. 


INTENDED AUDIENCE 


_ This manual is intended for system hardware and software architects and system 
engineers who are designing or are considering designing systems based on the 
Am29200 and Am29205 microcontrollers. 


USER’S MANUAL OVERVIEW 


‘This manual contains information on the Am29200 and Am29205 microcontrollers and is 
essential for system hardware and software architects and design engineers. Additional 
information is available in the form of data sheets, application notes, and other docu- 
mentation provided with software products and hardware-development tools. 


The information in this manual is organized into eighteen chapters: 


= Chapter 1 introduces the features and performance aspects of the Am29200 and 
Am29205 microcontrollers. 


m= Chapter 2 describes the programmer’s model of the Am29200 and Am29205 
microcontrollers, including the instruction set and register model. 


m= Chapter 3 expands on the programmer’s model, discussing different data formats 
and data handling. Instructions that manipulate external data are also discussed. 


m= Chapter 4 details the management of the run-time stack and defines the conven- 
tions that apply to procedure linkage and register usage. 


m Chapter 5 describes the internal pipelining and the effects of the pipeline on program 
behavior. 


m Chapter 6 describes the system-protection features provided by the Am29200 and 
Am29205 microcontrollers. 


m Chapter 7 provides an overview of the processor’s system interfaces and the 
system components that are integrated on-chip. 


= Chapter 8 describes the ROM interface. 
Chapter 9 describes the DRAM interface. 


Chapter 10 describes the peripheral interface adapter, which is used for glueless 
attachment of a number of peripheral components. 


Chapter 11 describes the DMA controller. 
Chapter 12 describes the programmable I/O port. 
Chapter 13 describes the parallel port. 

Chapter 14 describes the serial port. 

Chapter 15 describes the video interface. 


Chapter 16 provides a description of the interrupt and trap mechanism and the han- 
dling of interrupts and traps, including the operation of the on-chip interrupt controller. 


Chapter 17 describes the software and hardware facilities for debugging and testing. 
m Chapter 18 provides a detailed description of the instruction set. 
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For those readers desiring only a brief overview of the Am29200 and Am29205 
microcontrollers, Chapter 1 identifies the outstanding features of each device. This 
chapter addresses the basic software and hardware concerns. 


Chapters 2, 3, and 5 are recommended reading for both hardware and software 
developers. 


For sofiware architects and system programmers interested mainly in software-related 
issues, Chapters 4, 6, and 16 provide the necessary information. Chapters 17 and 18 
also provide related information. | 


For hardware architects and systems hardware designers interested mainly in hard- 
ware-related issues, Chapters 7 through 15 and Chapter 17 provide most of the 
required information. Chapters 5 and 18 also provide related information. 


For users already familiar with other 29K Family processors, Chapters 7-15 describe the 
on-chip peripherals and system functions unique to the Am29200 and Am29205 
microcontrollers. 


AMD DOCUMENTATION 


29K Family 
ORDER NO. DOCUMENT 


10620 Am29000® and Am29005™ Microprocessors User’s Manual 
and Data Sheet 
Describes the Am29000 and Am29005 microprocessors’ technical 
features, programming interface, and complete instruction set. 


15723 ~ Am29030™ and Am29035™ Microprocessors User’s Manual 
and DataSheet _ | 
Describes the Am29030 and Am29035 microprocessors’ technical 
features, programming interface, and complete instruction set. 


14779 Am29050™ Microprocessor User’s Manual 
Describes the Am29050 microprocessor’s technical features, 
programming interface, and complete instruction set. 


16361 Am29200 and Am29205 RISC Microcontrollers Data Sheet 
Describes the Am29200 and Am29205 microcontrollers’ technical 
features, including electrical and mechanical specifications. 


17741 Am29240™, Am29245™, and Am29243™ RISC Microcontrollers 
User’s Manual and Data Sheet 
Describes the Am29240, Am29245, and Am29243 microcontrollers’ 
technical features, programming interface, and complete instruction set. 


17882 Am29240, Am29245, and Am29243 RISC Microcontrollers Brochure 
Describes features, reference designs, and tool support for this series of 
high-performance RISC microcontrollers. 


18002 29K Family Comparison Chart 
Compares the features of all 29K microprocessors and microcontrollers in 
a single chart organized for easy reference. 


11426 Fusion29KSs™ Catalog 
, Provides information on more than 200 tools that speed a 29K Family 
embedded product to market. Includes products from over 100 expert. 
suppliers of embedded development solutions. Design solution chapters 
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XX 


12990 


15176 


10344 


16693 


include: laser printer and OCR solutions, graphics solutions, and 
networking solutions. 


Fusion Newsletter 

Contains quarterly updates on developments in the Am186 Family, 29K 
Family, and E series of microprocessors and features new Fusion Partner 
solutions. 


29K Laser Printer Solutions Brochure 

Reviews how the 29K Family of microprocessors fits into the laser printer 
marketplace. Includes a description of AMD’s PCL and PostScript® 
Laser29K™ Low-Cost Raster Image Processor demonstration boards. 


29K Family Design-Made-Easy Solutions Brochure 
Presents an overview of the entire 29K Family of microprocessors and — 
microcontrollers. Features development support products. 


_ RISC Design-Made-Easy™ Applications Guide 


Presents topics on the 29K Family, including interfaces to integer 
multipliers, context switching, TLB handlers, benchmarking applications, 
byte-writable memories for three-bus microprocessors, host interface 
(HIF) version 2.0 specification, using the Am29000 microprocessor as a 
high-performance DMA controller, and writing interrupt handlers. 


Development Tools 


17704 


10287 
10626 
10957 


Am29200 and Am29205 RISC Microcontroller Brochure 

Reviews how the SA-29200 and SA-29205 demonstration boards and 
the SA-29200 expansion board use the Am29200 or Am29205 
microcontroller to meet requirements for low-cost embedded 
applications. Includes additional support product and ordering 
information. 


MiniMON29K™ Portable Debug Monitor Data Sheet 
XRAY29K™ Source-Level Debugger Data Sheet 
High C® 29K Development Toolkit Data Sheet 


To order literature, contact your local AMD sales office or call: 800-2929-AMD, ext. 3 (in 
the U.S.), or 800-531-5202, ext. 55651 (in Canada), or direct dial from any location: 


512-602-5651. 


RELATED PUBLICATIONS | 
The IEEE Standard 1149.1-1990 (JTAG) may be ordered from 


IEEE Computer Society Press 
Customer Service Center 
10662 Los Vaqueros Circle 
P.O. Box 3014 | 

Los Alamitos, CA 90720-1264 
USA | 


IEEE Catalog No. SH13144 


1-800-CS-BOOKS 
714-821-4010 (fax) 
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This chapter provides a general evaluation of the Am29200 and Am29205 microcontrollers. 
A detailed technical description of the Am29200 and Am29205 microcontrollers is 
contained in subsequent chapters. This chapter informally describes the features of the 
two microcontrollers, concentrating on features which distinguish them from other 
available processors and describing how these features enhance system penomance 
and cost-effectiveness. This chapter consists of the following sections: 


m Distinctive Characteristics 
m Key Features and Benefits 
m Performance Overview 
m™ Debugging and Testing 


1.1 DISTINCTIVE CHARACTERISTICS 


The Am29200 and Am29205 RISC microcontrollers are highly integrated, 32-bit 

~ embedded processors implemented in complementary metal-oxide semiconductor 
(CMOS) technology. Through submicron technology, the Am29200 and Am29205 
microcontrollers incorporate a complete set of system facilities commonly found in 
printing, imaging, graphics, and other embedded applications. The distinctive features of 
each microcontroller are compared in Table 1-1. 


Based on the 29K architecture, the Am29200 and Am29205 microcontrollers are part of 
a growing family of RISC microcontrollers, which also includes the high-performance 
Am29240, Am29245, and Am29243 RISC microcontrollers. 
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Table 1-1 


1.1.1 


Am29200 and Am29205 Microcontrollers: Feature Summary 


FEATURE Am29200 Microcontroller 


Am29205 Microcontroller 












Data Bus Width 
Internal 32 bits 32 bits 
External 32 bits 16 bits 





ROM Interface 



















Banks 4 3 
Width . 8, 16, 32 bits 8, 16 bits 
ROM Size (Max/Bank) » 16 Mbytes 4 Mbytes 
Boot-up ROM Width 8, 16, 32 bits 16 bits 





Burst-mode access Supported Not Supported 


DRAM Interface 
Banks 



































4 4 







Width 16, 32 bits 16 bits only 
Size: 32-bit mode 16 Mbytes/bank — 

Size: 16-bit mode 8 Mbytes/bank 8 Mbytes/bank 
Video DRAM Supported 


Not Supported 








On-Chip DMA 
Width (ext. peripherals) 8, 16, 32 bits 8, 16 bits 
Externally Controlled 2 Channels 1 Channel 
External Master Access Yes No 
External Terminate Signal Yes. No 

6 2 
‘8, 16, 32 bits 8, 16 bits 
Programmable I/O Port (PIO) 
Signals 

Serial Ports | 
Ports 1 Port 1 Port 
DSR/DTR Supported Uses PIO signals 


Interrupt Controller 
External Interrupt Pins 
External Trap and Warn Pins 


Parallel Port Controller 
32-bit Transfer Yes 
ee 


JTAG Debug Support 

















Peripheral Interface Adapter 
(PIA) 

PIA Ports 
Data Width 







Am29200 Microcontroller 


The Am29200 microcontroller meets the common requirements of embedded applica- 
tions such as industrial control, graphics processing, imaging applications, laser printers, 
and general purpose applications requiring high-performance in a compact design. 
Figure 1-1 shows a block diagram of the Am29200 microcontroller. The microcontroller 
includes the following features: 


= Completely integrated system for embedded applications 
m Full 32-bit architecture | 

m 16- and 20-MHz operating frequencies 

a 


Price/performance flexibility. Support for several low-cost memory configurations 
allows performance points of 8, 6, 5, and 3 million instructions per second sustained 
(at 16 MHz). | | 


m 4-Gbyte virtual address space, 304-Mbyte physical address space implemented 
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m 192 general-purpose registers; three-address instruction architecture 

m Glueless system interfaces with on-chip wait state control 

m Four banks of ROM, each separately programmable for 8-, 16-, or 32-bit interface 

m Four banks of DRAM, each separately programmable for 16- or 32-bit interface 

m= Burst-mode and page-mode access support | 

m= DRAM mapping on-chip 

m Advanced debugging support 

m IEEE Std. 1149.1-1990 (JTAG) compliant Standard Test Access Port and 
Boundary Scan Architecture 

= Two-channel DMA controller with queuing on one channel 

@ 6-port peripheral interface adapter 

m 16-line programmable I/O port 

= Bidirectional bit serializer/deserializer (video interface) 

m Serial port (UART) 

m= Bidirectional parallel port controller 

m Interrupt controller 

@ On-chip timer | 

@ Binary compatibility with all 29K Family microcontrollers and microprocessors 


Figure 1-1 Am29200 Microcontroller Block Diagram 


Parallel Port Clock/ 2 DREQ 
Sono as 6 5 Control Y 5 2 DACK 
Lines MEMOLK ‘TAG Lines GREQ/GACK/TDMA 


‘3 Parallel Port 2-Channel DMA 16 ° 
Controller Controller 
serial VO 
Data 
Programmable 
Serial Port VO Port 
Printer/Scanner 
Video a Am29000 CPU Interrupt Interrupts, Traps 
a Controller 
ROM 


Chip Selects 
| oo | DRAM Controller 


| oo | 
4 7 
DTA: Timer/Counter 


a a 
PIA Address Instruction/Data DRAM 
Chip Selects Bus Bus 
Peripherals 
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1.1.2 Am29205 Microcontroller 


The Am29205 RISC microcontroller (see Figure 1-2) is a highly integrated, iswcest 
derivative of the Am29200 microcontroller, with a 16-bit instruction bus, fewer peripheral 
ports, and no JTAG interface. It includes the following features: 


Completely integrated system for embedded applications 

Full 16-bit external, 32-bit internal architecture 

12- and 16-MHz operating frequencies 

68-Mbyte address space 

Two-channel DMA controller (one external) — 

Three separately programmable ROM banks with16-bit ROM interface 


Fully functional 16-bit DRAM interface eOMpIEIS with address MUXing, Refresh, and 
RAS/CAS generation 


Two-port peripheral interface adapter 

Eight-line programmable 1/O port 

Bidirectional bit serializer/deserializer (video interface) 
Serial port (UART) 

Bidirectional parallel! port controller 

Interrupt controller | 


On-chip timer 
Figure 1-2 Am29205 Microcontroller Block Diagram 
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KEY FEATURES AND BENEFITS 


Complete Set of Common System Peripherals 


The Am29200 and Am29205 microcontrollers minimize system cost by incorporating a 
complete set of system facilities commonly found in embedded applications, eliminating 
the cost of additional components. 


The on-chip functions include: a ROM controller, a DRAM controller, a peripheral 
interface adapter, a DMA controller, a programmable I/O port, a parallel port, a serial 
port, and an interrupt controller. A serializer/deserializer (video interface) is also included 
for printer, scanner, and other imaging applications. 


By providing a complete set of common system peripherals on-chip and glueless 
interfacing to external memories and peripherals, these RISC microcontrollers let 
product designers capitalize on the very iow system cost made possible by the integra- 
tion of processor and peripherals. Many simple systems can be built using only the 
Am29200 or Am29205 microcontroller and external ROM and/or DRAM memory. 


ROM Controller (Chapter 8) 


The ROM controller supports four individual banks of ROM or other static memory in the 
Am29200 microcontroller and three banks in the Am29205 microcontroller. Each ROM 
bank has its own timing characteristics, and each bank can be a different size: either 8, 
16, or 32 bits wide in the Am29200 microcontroller and 8 or 16 bits wide in the Am29205 
microcontroller. The ROM banks can appear as a contiguous memory area of up to 64 
Mbytes in size on the Am29200 microcontroller. The ROM controller also supports writes 
to the ROM memory space for devices such as flash EPROMs and SRAMs. 


DRAM Controller (Chapter 9) 


The DRAM controller supports four separate banks of dynamic memory. On the 
Am29200 microcontroller, each bank can be a different size: either 16 or 32 bits wide. 
DRAM banks on the Am29205 microcontroller are 16 bits wide. The DRAM banks can 
appear as a contiguous memory area of up to 64 Mbytes in size on the Am29200 
microcontroller and 32 Mbytes on the Am29205 microconiroller. To support system 
functions such as on-the-fly data compression and decompression, four 64-Kbyte 
regions of the DRAM can be mapped into a 16-Mbyte virtual address space. 


Peripheral Interface Adapter (PIA) (Chapter 10) 

The peripheral interface adapter allows for additional system features implemented by 
external peripheral chips. The PIA permits glueless interfacing from the Am29200 
microcontroller to as many as six external peripheral regions and from the Am29205 


microcontroller to two external peripherals. 


DMA Controller (Chapter 11) 


The DMA controller in the Am29200 microcontroller provides two channels for transfer- 
ring data between the DRAM and internal or external peripherals. One of the DMA 
channels is double buffered to relax the constraints on the reload time. On the Am29205 
microcontroller, internal 32-bit transfers are supported on two DMA channels; external 
transfers are limited to 8-or 16-bit data accesses.on one DMA channel. 


interrupt Controller (Section 16.9) 


The interrupt controller generates and reports the status of interrupts caused by on-chip 
peripherals. 
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1.2.2 


1.2.3 


1/O Port (Chapter 12) 


The Am29200 microcontroller’s I/O port permits direct access to 16 individually program- 
mable external input/output signals. Eight signals are available on the Am29205 
microcontroller. These eight signals can be comiguiee to cause interrupts on either 
microcontroller. 


Parallel Port (Chapter 13) 


The parallel port implements a bidirectional IBM PC-compatible parallel interface to a 
host processor. 


Serial Port (Chapter 14) 
The serial port implements a full-duplex UART. 


Serializer/Deserializer (Chapter 15) 


The bidirectional bit serializer/deserializer (video interface) permits direct connection to a 
number of laser marking engines, video olenray= or raster input devices such as 
scanners. : 


Wide Range of Price/Performance Points 


To reduce design costs and time-to-market, one basic system design can be used as the 
foundation for an entire product line. From this design, numerous implementations of the 
product at various levels of price and performance may be derived with minimum time, 
effort, and cost. 


The Am29200 and Am29205 microcontrollers provide this capability through program- 
mable memory widths, burst-mode and page-mode access support, programmable wait 
states, and hardware and 29K Family software compatibility. A system can be upgraded 
without hardware and software redesign using various memory architectures. 


The ROM controller on the Am29205 microcontroller accommodates memories that are 
either 8 or 16 bits wide, while that of the Am29200 microcontroller supports either 8-, — 
16-, or 32-bit memories. The DRAM controller on the Am29205 microcontroller accom- 
modates dynamic memories that are 16 bits wide; the Am29200 microcontroller supports 


either 16- or 32-bit memories. 


These unique features provide a flexible interface to low-cost memory as well as a 
convenient, flexible upgrade path. For example, a system can start with a 16-bit memory 
design and can subsequently improve performance by migrating to a 32-bit memory 
design. One particular advantage is the ability to add memory in half-megabyte incre- 
ments. This provides significant cost savings for applications that do not require larger 


-memory upgrades. - 


The Am29200 microcontroller family allows users to address a wide range of 
price/performance points, with higher performance and lower cost than existing designs 


_ based on CISC microprocessors. 


Glueless System Interfaces 


The Am29200 and Am29205 microcontrollers also minimize system cost by providing a 
glueless attachment to external ROMs, DRAMs, and other peripheral components. 
Processor outputs have edge-rate control that allows them to drive a wide range of load 


capacitances with low noise and ringing. This eliminates the cost of external logic and 


buffering. 
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Bus- and Binary-Compatibility | 

Compatibility within a processor family is critical for achieving a rational, easy upgrade 
path. The Am29200 and Am29205 microcontrollers are members of a bus-compatible 
family of RISC microcontrollers, which also includes the high-performance Am29240, 
Am29245, and Am29243 microcontrollers. Future members of this family will improve in 
price and performance and system capabilities without requiring that users redesign 
their system hardware or software. Bus compatibility ensures a convenient upgrade path 
for future systems. 


The Am29200 microcontroller is binary compatible with the Am29240, Am29245, and 
Am29243 microcontrollers, as well as the Am29000, Am29005, Am29030, Am29035, 
and Am29050 microprocessors. The Am29200 microcontroller family provides a 
migration path to low-cost, highly integrated systems for products based on other 29K 
Family microprocessors, without requiring expensive rewrites of application software. 


Complete Development and Support Environment 


A complete development and support environment is vital for reducing a product’s 
time-to-market. Advanced Micro Devices has created a standard development environ- - 
ment for the 29K Family of processors. In addition, the Fusion29K third-party support 
organization provides the most comprehensive customer/partner program in the 
embedded processor market. 


Advanced Micro Devices offers a complete set of hardware and software tools for 
design, integration, debugging, and benchmarking. These tools, which are available now 
for the 29K Family, include the following: 


mw High C® 29K optimizing C compiler with assembler, linker, ANSI library functions, 
and 29K architectural simulator , 


m XRAY29K™ source-level debugger 
@ MiniMON29K™ debug monitor 
m™ Acomplete family of demonstration and development boards 


In addition, Advanced Micro Devices has developed a standard host interface (HIF) 
specification for operating system services, the Universal Debug Interface (UDI) for 
seamless connection of debuggers to ICEs and target hardware, and extensions for the 
UNIX common object file format (COFF). 


This support is augmented by an engineering hotline, an on-line bulletin board, and field 
application engineers. 


PERFORMANCE OVERVIEW 


The Am29200 and Am29205 microcontrollers offer a significant margin of performance 
over CISC microprocessors in existing embedded designs, since the majority of 
processor features were defined for the maximum achievable performance at a very low 
cost. This section describes the features of the Am29200 and Am29205 microcontrollers 
from the point of view of system performance. 


Instruction Timing (Section 2.1) 


The Am29200 and Am29205 microcontrollers use an arithmetic/logic unit, a field shift _ 
unit, and a prioritizer to execute most instructions. Each of these is organized to operate 
on 32-bit operands and provide a 32-bit result. All operations are performed in a single 
cycle. 
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The performance degradation of load and store operations is minimized in the Am29200 

and Am29205 microcontrollers by overlapping them with instruction execution, by taking 

advantage of pipelining, and by organizing the flow of external data into the processor so 
that the impact of external accesses is minimized. - 


1.3.2 Pipelining (Chapter 5) 


Instruction operations are overlapped with instruction fetch, instruction decode and 
operand fetch, instruction execution, and result write-back to the register file. Pipeline 
forwarding logic detects pipeline dependencies and routes data as required, avoiding 
delays that might arise from these dependencies. 


Pipeline interlocks are implemented by processor hardware. Except for a few special 
cases, it is not necessary to rearrange programs to avoid pipeline dependencies, 
although this is sometimes desirable for performance. 


1.3.3 Burst-Mode and Page-Mode Memories (Sections 8.3.5, 9.3.7) 


The Am29200 microcontroller directly supports burst-mode memories in ROM address 
space. The burst-mode memory supplies instructions at the maximum bandwidth, 
without the complexity of an external cache or the performance degradation due to 
cache misses. 


Both the Am29200 and Am29205 microcontrollers can also use the page-mode capabili- 
ty of common DRAMs to improve the access time in cases where page-mode accesses 
can be used. This is particularly useful in very low-cost systems with 16-bit-wide 
DRAMs, where the DRAM must be accessed twice for each 32-bit operand. _ 


1.3.4 Instruction Set Overview (Section 2.1, Chapter 18) 


The Am29200 and Am29205 microcontrollers employ a three-address instruction set 
architecture. The compiler or assembly-language programmer is given complete 
freedom to allocate register usage. There are 192 general-purpose registers, allowing 
the retention of intermediate calculations and avoiding needless data destruction. 
Instruction operands may be contained in any of the general-purpose registers, and the 
results may be stored into any of the general-purpose registers. | 


The instruction set contains 117 instructions that are divided into nine classes. These 
classes are integer arithmetic, compare, logical, shift, data movement, constant, floating 
point, branch, and miscellaneous. The floating-point instructions are not executed 
directly, but are emulated by trap handlers. 


All directly implemented instructions are capable of executing in one processor cycle, 
with the exception of interrupt returns, loads, and stores. 


1.3.5 Data Formats (Chapter 3) 


The Am29200 and Am29205 microcontrollers define a word as 32 bits of data, a 
half-word as 16 bits, and a byte as 8 bits. The hardware provides direct support for 
word-integer (signed and unsigned), word-logical, word-boolean, half-word integer 
(signed and unsigned), and character data (signed and unsigned). © 


Word-boolean data is based on the value contained in the most significant bit of the 
word. The values TRUE and FALSE are represented by the MSB values 1 and 0, 
respectively. | 


Other data formats, such as character strings, are supported by instruction sequences. 
Floating-point formats (single and double precision) are defined for the processor; 
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however, there is no direct hardware support for these formats in the Am29200 or 
Am29205 microcontroller. 


Protection (Chapter 6) | 


The Am29200 and Am29205 microcontrollers offer two mutually exclusive modes of 
execution, the user and supervisor modes, that restrict or permit accesses to certain 
processor registers and external storage locations. 


The register file may be configured to restrict accesses to supervisor-mode programs on 
a bank-by-bank basis. 


DRAM Mapping (Section 9.3.5) 


The Am29200 and Am29205 microcontrollers provide a 16-Mbyte region of virtual 
memory that is mapped to one of four 64-Kbyte blocks in the physical DRAM memory. 
This supports system functions such as on-the-fly data compression and decompres- 
sion, allowing a large data structure such as a frame buffer to be stored in a compressed 
format while the application software operates on a region of the structure that is 
decompressed. Using a mechanism that is analogous to demand paging, system 
software moves data between the compressed and decompressed formats in a way that 
is invisible to the applications software. This feature can greatly reduce the amount of 
memory required for printing, imaging, and graphics applications. 


interrupts and Traps (Chapter 16) 


_ When the microcontroller takes an interrupt or trap, it does not automatically save its 


current state information in memory. This lightweight interrupt and trap facility greatly 
improves the performance of temporary interruptions such as simple operating-system 
calls that require no saving of state information. 


In cases where the processor state must be saved, the saving and restoring of state 
information is under the control of software. The methods and data structures used to 
handle interrupts—and the amount of state saved—may be tailored to the needs of a 
particular system. 


Interrupts and traps are dispatched through a 256-entry vector table that directs the 
processor to a routine that handles a given interrupt or trap. The vector table may be 
relocated in memory by the modification of a processor register. There may be multiple 
vector tables in the system, though only one is active at any given time. 


The vector table is a table of pointers to the interrupt and trap handlers and requires only 
1 Kbyte of memory. The processor performs a vector fetch every time an interrupt or trap 
is taken. The vector fetch requires at least three cycles, in addition to the number of 
cycles required for the basic memory access. 


DEBUGGING AND TESTING (Chapter 17) 


Software debugging on the Am29200 and Am29205 microcontrollers is facilitated by the 
instruction trace facility and instruction breakpoints. Instruction tracing is accomplished 
by forcing the processor to trap after each instruction has been executed. Instruction 
breakpoints are implemented by the HALT instruction or by a software trap. 


The Am29200 microcontroller provides two additional features to assist system debug- 
ging and testing: | 


m The test/development interface is composed of a group of pins that indicate the state 
of the processor and control the operation of the processor. 
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m An IEEE Standard 1149.1-1990 (JTAG) compliant Standard Test Access Port and 
Boundary-Scan Architecture. The test access port provides a scan interface for test- 
ing system hardware in a production environment, and contains extensions that allow 
a hardware-development system to control and observe the processor without inter- 
posing hardware between the processor and system. 


Hardware testing and debugging on the Am29205 microcontroller are supported by 
using an Am29200 microcontroller to emulate an Am29205 microcontroller. 
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Pa PROGRAMMING 


This chapter focuses on programming the Am29200 and Am29205 microcontrollers. First, 
this chapter presents an instruction set overview. It then describes the register model, 
emphasizing the general- and special-purpose registers. This chapter also describes" 
certain special-purpose registers that deal directly with instruction execution. Finally, this 
chapter describes general considerations related to application programming. 


2.1 INSTRUCTION SET 


The Am29200 and Am29205 microcontrollers recognize 117 instructions. All instructions 
execute in a single cycle, except for IRET, IRETINV, LOADM, STOREM, and certain 
arithmetic instructions such as floating-point instructions. Floating-point and integer 
multiply and divide instructions are not implemented directly in hardware, but are imple- 
mented by a virtual arithmetic interface invoked using instruction traps (see Section 2.8). | 


Most instructions deal with general-purpose registers for operands and results; however, 
in most instructions, an 8-bit constant can be used in place of a register-based operand. 
Some instructions deal with special-purpose registers and external devices and memories. . 


This section describes the nine instruction classes and provides a brief summary of 
instruction operations. A detailed instruction specification is contained in Chapter 18. 
Section 18.1 describes the nomenclature used here. | 


If the processor attempts to execute an unimplemented instruction, an Illegal Opcode 
trap occurs unless the instruction is reserved for emulation (see Section 2.1.10). 
Reserved instructions are assigned individual traps. 


2.1.1 Integer Arithmetic 


The Integer Arithmetic instructions perform add, subtract, multiply, and divide operations 
on word-length integers. Certain instructions in this class cause traps if signed or 
unsigned overflow occurs during the execution of the instruction. There is support for. 
multiprecision arithmetic on operands whose lengths are multiples of words. All instruc- 
tions in this class set the ALU Status Register. The Integer Arithmetic instructions are 
shown in Table 2-1. In the Am29200 and Am29205 microcontrollers, the integer multiply 
and divide instructions cause traps to routines which perform the operations. 


2.1.2 Compare 


The Compare instructions (Table 2-2) test for various relationships between two values. 
For all Compare instructions except the CPBYTE instruction, the comparisons are 
performed on word-length signed or unsigned integers. There are two types of Compare 
instructions. The first type places a Boolean value reflecting the outcome of the compare 
into a general-purpose register. For the second type, assert instructions, instruction 
execution continues only if the comparison is true; otherwise a trap occurs. The assert 
instructions specify a vector for the trap (see Section 16.3). 


The assert instructions support run-time operand checking and operating-system calls. If 
the trap occurs in the User mode, and a trap number between 0 and 63 is specified by 
the instruction, a Protection Violation trap occurs. 
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Integer Arithmetic Instructions 


Mnemonic 


ADD 
ADDS 


ADDU 


ADDC 
ADDCS 


. ADDCU 


SUB 
SUBS 


SUBU 


SUBC 


~ SUBCS 


SUBCU 


SUBR 
SUBRS 


SUBRU 


SUBRC 
SUBRCS 


SUBRCU 


MULTIPLU 
MULTIPLY 
MUL 
MULL 
MULTM 
MULTMU 
MULU 
DIVIDE 


DIVIDU 


DIVO - 
DIV 
DIVL 
DIVREM 


Operation Description 


DEST <— SRCA + SRCB 


DEST — SRCA + SRCB 
IF signed overflow THEN Trap (Out of Range) 


DEST « SRCA + SRCB 
IF unsigned overflow THEN Trap (Out of Range) 


DEST <— SRCA + SRCB + © 


DEST — SRCA + SRCB + C 
IF signed overflow THEN Trap (Out of Range) 


DEST < SRCA + SRCB+C 
IF unsigned overflow THEN Trap (Out of Range) 


DEST «+ SRCA - SRCB 


DEST < SRCA —-SRCB | 
IF signed overflow THEN Trap (Out of Range) 


DEST «+ SRCA—-SRCB 
IF unsigned underflow THEN Trap (Out of Range) 


DEST <— SRCA-SRCB-1+C 


DEST «+ SRCA-SRCB-1+C 
IF signed overflow THEN Trap (Out of Range) 


DEST « SRCA—SRCB-1+C 
IF unsigned underflow THEN Trap (Out of Range) 


DEST <- SRCB -SRCA 


DEST «+ SRCB-SRCA_.. 
IF signed overflow THEN Trap (Out of Range) 


DEST «+ SRCB—-SRCA 
IF unsigned underflow THEN Trap (Out of Range) 


DEST <— SRCB -—-SRCA-1+C 


DEST — SRCB —-SRCA-1+C 
IF signed overflow THEN Trap (Out of Range) 


DEST «— SRCB —- SRCA— 1 +C 
IF unsigned underflow THEN Trap (Out of Range) 


DEST <— SRCA - SRCB (unsigned) 

DEST < SRCA - SRCB (signed) 

Perform one-bit step of a multiply operation (signed) 
Complete a sequence of multiply steps 

DEST < SRCA - SRCB (signed), most significant bits 
DEST < SRCA - SRCB (unsigned), most significant bits 
Perform one-bit step of a multiply operation (unsigned) 


DEST < (Q//SRCA)/SRCB (signed) 
Q — Remainder 


DEST < (Q//SRCA)/SRCB (unsigned) 
Q <— Remainder 


Initialize for a sequence of divide steps (unsigned) 
Perform one-bit step of a divide operation (unsigned) 
Complete a sequence of divide steps (unsigned) 
Generate remainder for divide operation (unsigned) 
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Compare Instructions 


Mnemonic 


CPEQ 
CPNEQ 
CPLT 
CPLTU 
CPLE 
CPLEU 
CPGT 
CPGTU 
CPGE 
CPGEU 


CPBYTE 


ASEQ 
ASNEQ 
ASLT 
ASLTU 
ASLE 
ASLEU 
ASGT 
ASGTU 
ASGE 


ASGEU 


Operation Description 


IF SRCA = SRCB THEN DEST < TRUE 
ELSE DEST «+ FALSE 


IF SRCA <> SRCB THEN DEST < TRUE 
ELSE DEST <« FALSE 


IF SRCA < SRCB THEN DEST < TRUE 
ELSE DEST <— FALSE 


IF SRCA < SRCB (unsigned) THEN DEST <— TRUE 
ELSE DEST < FALSE 


IF SRCA s SRCB THEN DEST ¢ TRUE 
ELSE DEST <— FALSE 


IF SRCA < SRCB (unsigned) THEN DEST — TRUE 
ELSE DEST — FALSE 


IF SRCA > SRCB THEN DEST < TRUE 
ELSE DEST < FALSE 


IF SRCA > SRCB (unsigned) THEN DEST <- TRUE 
ELSE DEST < FALSE 


If SRCA = SRCB THEN DEST < TRUE 
ELSE DEST < FALSE 


IF SRCA = SRCB (unsigned) THEN DEST <— TRUE 
ELSE DEST < FALSE 


IF (SRCA.BYTEO = SRCB.BYTEO) OR 

(SRCA.BYTE1 = SRCB.BYTE1) OR 

(SRCA.BYTE2 = SRCB.BYTE2) OR 

(SRCA.BYTE3 = SRCB.BYTE3) THEN DEST «< TRUE 
ELSE DEST < FALSE 


~1F SRCA = SRCB THEN Continue 


ELSE Trap (VN) 


IF SRCA <> SRCB THEN Continue 
ELSE Trap (VN) 


IF SRCA < SRCB THEN Continue 
ELSE Trap (VN) 


IF SRCA < SRCB (unsigned) THEN Continue 
ELSE Trap (VN) 


IF SRCA <SRCB THEN Continue 
ELSE Trap (VN) 


IF SRCA < SRCB (unsigned) THEN Continue 
ELSE Trap (VN) 


IF SRCA > SRCB THEN Continue 
ELSE Trap (VN) 


IF SRCA > SRCB een THEN Continue 
ELSE Trap (VN) 


IF SRCA = SRCB THEN Continue 
ELSE Trap (VN) 


IF SRCA > SRCB (unsigned) THEN Continue 


ELSE Trap (VN) 
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Table 2-3 


2.1.4 


Table 2-4 


2.1.5 


2-4 


Logical 


The Logical instructions (Table 2-3) perform a set of bit-by-bit Boolean functions on 
word-length bit strings. All instructions in this class set the ALU Status Register. 


Logical Instructions 


Mnemonic Operation Description 
AND DEST — SRCA & SRCB 
ANDN DEST — SRCA & ~ SRCB 
NAND DEST <~ (SRCA & SRCB) 
OR DEST — SRCA|SRCB 
NOR DEST < ~ (SRCAI SRCB) 
XOR DEST — SRCA4 SRCB 
XNOR DEST < ~ (SRCA“ SRCB) 
Shift 


The Shift instructions (Table 2-4) perform arithmetic and logical shifts. All but the 
EXTRACT instruction operate on word-length data and produce a word-length result. 
The EXTRACT instruction operates on double-word data and produces a word-length 
result. If both parts of the double-word for the EXTRACT instruction are from the same 
source, the EXTRACT operation is equivalent to a rotate operation. For each operation, 
the shift count is a 5-bit integer, specifying a shift amount in the range of 0 to 31 bits. 


Shift Instructions 


Mnemonic Operation Description 

SLL DEST <— SRCA << SRCB (zero fill) 

SRL DEST <— SRCA >> SRCB (zero fill) 

SRA DEST <— SRCA >> SRCB (sign fill) 

EXTRACT DEST < high-order word of (SRCA//SRCB << FC) 


Data Movement 


The Data Movement instructions (Table 2-5) move bytes, half-words, and words 
between processor registers. In addition, they move data between general-purpose 
registers and external devices, and memories. The instructions LOADL and STOREL 
are provided for compatibility with other 29K processors and are treated as LOAD and 
STORE instructions. Similarly, the instructions MFTLB and MTTLB perform no operation, 
except that both are privileged instructions. 
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Data Movement Instructions 


Mnemonic 


LOAD 
LOADL 


LOADSET 


LOADM 


STORE 


STOREL 


STOREM 


EXBYTE 


EXHW 


EXHWS 
INBYTE 


INHW 


MFSR 
MFTLB 
MTSR 
MTSRIM 
MTTLB 


Constant 


Operation Description 


DEST « EXTERNAL WORD [SRCB] 
Implemented as LOAD 


DEST <— EXTERNAL WORD [SRCB] 
EXTERNAL WORD [SRCB] < h‘FFFFFFFF’ 


DEST.. DEST + COUNT — 
EXTERNAL WORD [SRCB] .. 
EXTERNAL WORD [SRCB + COUNT - 4] 


EXTERNAL WORD [SRCB] <— SRCA 
Implemented as STORE 


EXTERNAL WORD [SRCB] .. 
EXTERNAL WORD [SRCB + COUNT - 4] — 
SRCA .. SRCA + COUNT 


DEST < SRCB, with low-order byte replaced by byte in SRCA 
selected by BP 


DEST < SRCB, with low-order half-word replaced by half-word in SRCA 
selected by BP : 


DEST < half-word in SRCA selected by BP, sign-extended to 32 bits 


DEST « SRCA, with byte selected by BP replaced by low-order byte 
of SRCB 


DEST « SRCA, with half-word selected by BP replaced by low-order 
half-word of SRCB 


DEST < SPECIAL 

no operation (privileged) 
SPDEST — SRCB 
SPDEST <- 0116 


no operation (privileged) 


The Constant instructions (Table 2-6) provide the ability to place half-word and word 
constants into registers. Most instructions in the instruction set allow an 8-bit constant as 
an operand. The Constant instructions allow the construction of larger constants. 


Constant Instructions 


Mnemonic 


CONST 
CONSTH 
CONSTN 


Operation Description 


DEST <- 0116 | 
Replace high-order half-word of SRCA by 116 
DEST < 1116 
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2.1.7 Floating Point 
The Floating-Point instructions (Table 2-7) provide operations on single-precision (32-bit) 
or double-precision (64-bit) floating-point data. They also provide conversions between 
single-precision, double-precision, and integer number representations. In the Am29200 
and Am29205 microcontrollers, these instructions cause traps (specified by the vector 
numbers listed in the table) to routines which perform the floating-point operations. 


Table 2-7 Floating-Point Instructions 


Mnemonic Operation Description Vector Number 


FADD DEST (single-precision) «<- SRCA (single-precision) 48 
+ SRCB (single-precision) 

DADD _ DEST (double-precision) |< SRCA (double-precision) — 49 
+ SRCB (double-precision) 

FSUB DEST (single-precision) <- SRCA (double-precision) 50 
— SRCB (single-precision) 

DSUB } DEST (double-precision) |< SRCA (double-precision) | 51 
— SRCB (double-precision) 

FMUL DEST (single-precision) <- SRCA (single-precision) 52 

| - SRCB (single-precision) 

FDMUL DEST (double-precision) |< SRCA (single-precision) 57 
- SRCB (single-precision) 

DMUL DEST (double-precision) |< SRCA (double-precision) 53 

| - SRCB (double-precision) 

FDIV DEST (single-precision) < SRCA (single-precision 54 
/ SRCB (single-precision) 

DDIV DEST (double-precision) |< SRCA (double-precision) 55 
/ SRCB (double-precision) | 

FEQ IF SRCA (single-precision) = SRCB (single-precision) 42 

THEN DEST <— TRUE 
ELSE DEST < FALSE 
DEQ IF SRCA (double-precision) = SRCB (double-precision) 43 
THEN DEST <— TRUE 
ELSE DEST < FALSE 
FGE IF SRCA (single-precision) >= SRCB (single-precision) 46 


THEN DEST < TRUE 
ELSE DEST < FALSE 


DGE IF SRCA (double-precision) >= SRCB (double-precision) 47 
THEN DEST < TRUE 
ELSE DEST < FALSE 


FGT IF SRCA (single-precision) > SRCB (single-precision) | 44 
THEN DEST <- TRUE 
ELSE DEST<- FALSE 
DGT IF SRCA (double-precision) > SRCB (double-precision) 45 


THEN DEST <— TRUE 
ELSE DEST < FALSE 


SQRT DEST (single-precision, double-precision) 37 
<- SQRT [SRCA (single-precision, double-precision)] 

CONVERT DEST (integer, single-precision, double-precision) 36 
< SRCA (integer, single-precision, double-precision) 

CLASS DEST < CLASS [SRCA (single-precision, double-precision)] 38 
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Branch 


The Branch instructions (Table 2-8) control the execution flow of instructions. Branch 
target addresses may be absolute, relative to the program counter (with the offset given 
by a signed instruction constant), or contained in a general-purpose register. For 
conditional jumps, the outcome of the jump is based on a Boolean value in a general- 
purpose register. Procedure calls are unconditional and save the return address ina 
general-purpose register. All branches have a delayed effect; the instruction following 


_ the branch is executed regardless of the outcome of the branch. 


Branch Instructions 


Mnemonic Operation Description 


CALL DEST <— PC//00 +8 
PC — TARGET 
Execute delay instruction 


CALL! DEST < PC//00 +8 
PC <— SRCB 
Execute delay instruction 


JMP PC <— TARGET 
Execute delay instruction 


JMPI PC — SRCB 
Execute delay instruction 


JMPT IF SRCA = TRUE THEN PC <— TARGET 
Execute delay instruction 


JMPTI IF SRCA = TRUE THEN PC <— SRCB 
Execute delay instruction 


JMPF- IF SRCA = FALSE THEN PC < TARGET 
Execute delay instruction 


JMPFI IF SRCA = FALSE THEN PC < SRCB 
Execute delay instruction 


JMPFDEC IF SRCA = FALSE THEN 
SRCA <— SRCA — 1 
PC <— TARGET 
ELSE 
SRCA <— SRCA — 1 
Execute delay instruction 


Miscellaneous 


The Miscellaneous instructions (Table 2-9) perform various operations that cannot be 
grouped into other instruction classes. In certain cases, these are control functions 
available only to Supervisor-mode programs. The instructions INV and IRETINV are 
provided for compatibility with other 29K processors. INV performs no operation, and 
IRETINV performs the same operations as IRET. Both are privileged instructions. 
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Table 2-10 
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Miscellaneous Instructions 


Mnemonic Operation Description 

CLZ Determine number of leading zeros in a word 

SETIP Set IPA, IPB, and IPC with operand register numbers 
EMULATE Load IPA and IPB with operand register numbers and Trap (VN) 
INV No operation 

IRET | Perform an interrupt return sequence 

IRETINV Perform an interrupt return sequence 

HALT Enter Halt mode 


Reserved Instructions 
Sixteen operation codes are reserved for instruction emulation. Each of these instruc- 


tions causes a trap and sets the indirect pointers IPC, IPA, and IPB. The relevant — 


operation codes, and the corresponding trap vectors, are as follows: 


Reserved Instructions 


Operation Codes (Hexadecimal) Trap Vector Numbers (Decimal) 


D8—DD — 24-29 
E7—-E9 39-41 
F8 56 

FA-FF 58-63 


The reserved instructions are intended for future processor enhancements, and users 
desiring compatibility with future processor versions should not use them for any 
purpose. 


REGISTER MODEL 


The microcontroller has two classes of registers that are accessible by instructions. 
These are the general-purpose registers and the special-purpose registers. Any 
operation available to the microcontroller can be performed on the general-purpose 
registers, while special-purpose registers are accessed only by the instructions MTSR, . 
MTSRIM, and MFSR. This section describes the general-purpose and special-purpose 
registers. 


General-Purpose Registers 


The microcontroller incorporates 192 general-purpose registers. The organization of the 
general-purpose registers is diagrammed in Figure 2-1. 


General-purpose registers hold the following types of operands for program use: 
m 32-bit addresses 

32-bit signed or unsigned integers 

32-bit branch-target addresses 

32-bit logical bit strings 


8-bit signed or unsigned characters 
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Figure 2-1 General-Purpose Register Organization 
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- 16-bit signed or unsigned integers 
Word-length Booleans 


ai 
a 
m Single-precision floating-point numbers 
= 


Double-precision floating-point numbers (in two sscieiak locations) 


Because a large number of general-purpose registers are provided, a large amount of 
frequently used data can be kept on-chip, where access time is fastest. 


Instructions can specify two general-purpose registers for source operands and one 
general-purpose register for storing the instruction result. These registers are specified 
by three 8-bit instruction fields containing register numbers. A register may be specified 
directly by the instruction, or indirectly by one of three special-purpose registers. 


Register Addressing 


The general-purpose registers are partitioned into 64 global registers and 128 local 
registers, differentiated by the most significant bit of the register number. The distinction 
between global and local registers is the result of register-addressing considerations. 


The following terminology is used to describe the addressing of general- purpose 
registers: 


m= Register number, a software-level number for a general-purpose register. For 
example, this is the number contained in an instruction field. Register numbers range 
from 0 to 255. | 


m= Global-register number, a software-level number for a global register. Global-register 
numbers range from 0 to 127. 


m Local-register number, a software-level number for a local register. Local-register 
numbers range from 0 to 127. ~ 


m Absolute-register number, a hardware-level number used to select a general-purpose 
register in the register file. Absolute-register numbers range from 0 to 255. — 


Global Registers 


When the most significant bit of a register number is 0, a global register is selected. The 
seven least significant bits of the register number give the global-register number. For 
global registers, the absolute-register number is equivalent to the register number. 


Global registers 2 through 63 are not implemented. An attempt to access these registers 
yields unpredictable results; however, they may be protected from User-mode access by 
the Register Bank Protect Register (see Section 6.2.1). 


The register numbers associated with Global Registers 0 and 1 have special meaning. 
The number for Global Register 0 specifies that an indirect pointer is to be used as the 


~ source of the register number (see Section 2.3); there is an indirect pointer for each of 


the instruction operand/result registers. Global Register 1 contains the Stack Pointer, 
which ts used in the addressing of local registers. 


Local Registers 


When the most significant bit of a register number is 1, a local register is selected. The 
seven least significant bits of the register number give the local-register number. For 
local registers, the absolute-register number is obtained by adding the local-register 
number to bits 8—2 of the Stack Pointer and truncating the result to seven bits; the most 
significant bit of the original register number is unchanged (i.e., it remains a 1). 


Programming 


2.2.1.4 


Bute 


AMD zl 


The Stack Pointer addition applied to local-register numbers provides a limited form of 
base-plus-offset addressing within the local registers. The Stack Pointer contains the 
32-bit base address. This assists run-time storage management of variables for 
dynamically nested procedures (see Chapter 4). 


Local-Register Stack Pointer 


The Stack Pointer is a 32-bit register that may be an operand of an instruction as any 
other general-purpose register. However, a shadow copy of Global Register 1 is 
maintained by processor hardware for use in local-register addressing. This shadow 
copy is set only with the results of Arithmetic and Logical instructions. If the Stack 
Pointer is set with the result of any other instruction class, local registers cannot be 
accessed predictably until the Stack Pointer i is set once again with an Arithmetic or 
Logical instruction. 


A modification of the Stack Pointer has a delayed effect on the addressing of local 
registers, as discussed in Section 5.6. 


Special-Purpose Registers 
The microcontroller contains 24 special-purpose registers. The organization of the 
special-purpose registers is shown in Figure 2-2. 


Special-purpose registers provide controls and data for certain processor operations. 
Some special-purpose registers are updated dynamically by the processor, independent 


. of software controls. Because of this, a read of a special-purpose register following a 


write does not necessarily get the data that was written. 


Some special-purpose registers have fields reserved for future processor implementa- 
tions. When a special-purpose register is read, a bit in a reserved field is read as a O. An 
attempt to write a reserved bit with a 1 has no effect; however, this should be avoided 
because of upward-compatibility considerations, except for bits 5 and 6 of the Current 
Processor Status Register. These bits are used to disable address translation in other 29K 
processors and may be written with 1 in the Am29200 and Am29205 microcontrollers. 


The special-purpose registers are accessed by explicit data movement only. Instructions 
that move data to or from a special-purpose register specify the special-purpose register 
by an 8-bit field containing a special-purpose register number. Register numbers are 
specified directly by instructions. 


The special-purpose registers are partitioned into protected and unprotected registers. 
Special-purpose registers numbered 0-127 and 160-255 are protected (note that not all 
of these are implemented). Special-purpose registers numbered 128-159 are unpro- 
tected (again, not all are implemented). 


Protected special-purpose registers numbered 0-127 are accessible only by programs 
executing in the Supervisor mode. An attempted read or write of a special-purpose 
register by a User-mode program causes a Protection Violation trap to occur. Special- 
purpose registers numbered 160-255, though architecturally unprotected, are not 
accessible by programs in the User mode or the Supervisor mode. These register 
numbers are reserved for virtual registers in the arithmetic architecture, and any 
attempted access causes a Protection Violation trap. 


The Floating-Point Environment Register, Integer Environment Register, and Floating- 
Point Status Register are not implemented in processor hardware. These registers are 
implemented via the virtual arithmetic interface provided on the Am29200 and Am29205 
microcontrollers (see Section 2.8). 
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Figure 2-2. Special-Purpose Registers 

















Register Number Protected Registers Mnemonic 
0 VAB 
1 OPS 
2 CPS 
3 CFG 
4 CHA 
5 CHD 
6 CHC 
7 RBP 
8 TMC 
9 TMR 
10 [Program Gountero._| PCO 
1 PCr 
12 pc2 

Unprotected Registers 
128 IPC 
129 IPA 
130 IPB 
191. |e || @ 
132 ALU 
133 BP 
134 FC 
135 CR 
160 FPE 
161 INTE 
162 Floating-Point Status (virtual) FPS | 





An attempted read of an unimplemented special-purpose register yields an unpredict- 
able value. An attempted write of an unimplemented, protected special-purpose register 
has an unpredictable effect on processor operation, unless the write causes a Protection 
Violation. An attempted write of an unimplemented, unprotected special-purpose register 
has no effect; however, this should be avoided because of upward-compatibility 
considerations. 


2.3 ADDRESSING REGISTERS INDIRECTLY 


Specifying Global Register O as an instruction operand register or result register causes 
an indirect access to the general-purpose registers. In this case, the absolute-register 
number is provided by an indirect pointer contained in a special-purpose register. 


Each of the three possible registers for instruction execution has an associated 8-bit 
indirect pointer. Indirect register numbers can be selected independently for each of the 
three operands. Since the indirect pointers contain absolute-register numbers, the number 
in an indirect pointer is not added to the Stack Pointer when local registers are selected. 


The indirect pointers are set by the MTSR, MTSRIM, SETIP, and EMULATE instructions, 
and by floating-point, MULTIPLY, MULTM, MULTIPLU, MULTMU, DIVIDE, and DIVIDU 
instructions. 
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For a move-to-special-register instruction, an indirect pointer is set with bits 9-2 of the 
32-bit source operand. This provides consistency between the addressing of words in 
general-purpose registers and the addressing of words in external devices or memories. A 
modification of an indirect pointer using a move-to-special-register instruction has a 
delayed effect on the addressing of general-purpose registers (see Section 5.6). 


For the remaining instructions, all three indirect pointers are set simultaneously with the 
absolute-register numbers derived from the register numbers specified by the instruc- 
tion. For any local registers selected by the instruction, the Stack-Pointer addition is 
applied to the register numbers before the indirect pointers are set. 


Except when an indirect pointer is set by a move-to-special-register instruction, register 
numbers stored into the indirect pointers are checked for bank-protection violations at 
the time the indirect pointers are set. 


indirect Pointer C Register (IPC, Register 128) 


This unprotected special-purpose register (Figure 2-3) provides the RC-operand register 
number (see Section 18.3) when an instruction RC field has the value zero (i.e., when 
Global Register 0 is specified). 


Indirect Pointer C Register 


31 23 15 7 


0 


Bits 31-10: Reserved 


Bits 9-2: Indirect Pointer C (IPC)—The 8-bit IPC field contains an absolute-register 
number for a general-purpose register. This number directly selects a register. (Stack- 
Pointer addition is not performed in the case of local registers.) 


Bits 1-0: Zeros—The IPC field is aligned for compatibility with word addresses. 


indirect Pointer A Register (IPA, Register 129) 


This unprotected special-purpose register (Figure 2-4) provides the RA-operand register 
number (see Section 18.3) when an instruction RA field has the value zero (i.e., when 
Global Register 0 is specified). 


Indirect Pointer A Register 
31 23 15 7 


© 


Reserved IPA 


Bits 31-10: Reserved 


Bits 9-2: Indirect Pointer A (IPA)—The 8-bit IPA field contains an absolute-register 
number for either a general-purpose register or a local register. This number directly 
selects a register. (Stack-Pointer addition is not performed in the case of local! registers.) 


Bits 1-0: Zeros—The IPA field is aligned for compatibility with word addresses. 
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Figure 2-5 


Indirect Pointer B Register (IPB, Register 130) 


This unprotected special-purpose register (Figure 2-5) provides the RB-operand register 
number (see Section 18.3) when an instruction RB field has the value Zero (i.e., when 
Global Register 0 is specified). | 


Indirect Pointer B Register 
31 23 15 7 fe) 


2.4 
2.4.1 


Figure 2-6 
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Bits 31-10: Reserved 


Bits 9—2: Indirect Pointer B (IPB)—The 8-bit IPB field contains an absolute-register 
number for a general-purpose register. This number directly selects a register. (Stack- 
Pointer addition is not performed in the case of local registers.) 


Bits 1-0: Zeros—The IPB field is aligned for compatibility with word addresses. 


INSTRUCTION ENVIRONMENT 


This section describes the special-purpose registers that affect the execution of Floating- 
Point and Integer Arithmetic instructions. 


Floating-Point Environment Register (FPE, Register 160) 


_ This unprotected special-purpose register (Figure 2-6) contains control bits that affect 


the execution of floating-point operations. This register is not implemented directly by 
processor hardware, but is implemented by the virtual arithmetic software. 


Floating-Point Environment Register 





Bits 31-9: Reserved 


Bit 8: Fast Float Select (FF)—The FF bit being 1 enables fast floating-point operations, 
in which certain requirements of the IEEE floating-point specification are not met. This 
improves the performance of certain operations by sacrificing conformance to the IEEE 
specification. 


Bits 7-6: Floating-Point Round Mode (FRM)—This field specifies the default mode 


used to round the results of floating-point operations, as follows: 


FRM1—0 Round Mode 


00 Round to nearest 
01 Round to —°° 

10 Round to + ce 

11 Round to zero 
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Bit 5: Floating-Point Divide-By-Zero Mask (DM)—If the DM bit is 0, a Floating-Point 
Exception trap occurs when the divisor of a floating-point division operation is zero and 
the dividend is a non-zero, finite number. If the DM bit is 1, a Floating-Point Exception 
trap does not occur for divide-by-zero. 


Bit 4: Floating-Point Inexact Result Mask (XM)—If the XM bit is 0, a Floating-Point 
Exception trap occurs when the result of a floating-point operation is not equal to the 
infinitely precise result. If the XM bit is 1, a Floating-Point Exception trap does not occur 
for an inexact result. | 


Bit 3: Floating-Point Underflow Mask (UM)—lIf the UM bit is 0, a Floating-Point 
Exception trap occurs when the result of a floating-point operation is too small to be 
expressed in the destination format. If the UM bit is 1, a Floating-Point Exception trap 
does not occur for underflow. 


Bit 2: Floating-Point Overflow Mask (VM)—lf the VM bit is 0, a Floating-Point Excep- 
tion trap occurs when the result of a floating-point operation is too large to be expressed 
in the destination format. If the VM bit is 1, a Floating-Point Exception trap does not 
occur for overflow. 


Bit 1: Floating-Point Reserved Operand Mask (RM)—If the RM bit is 0, a Floating- 
Point Exception trap occurs when one or more input operands to a floating-point 
operation is a reserved value, or when the result of a floating-point operation is a 
reserved value. If the RM bit is 1, a Floating-Point Exception trap does not occur for 
reserved operands. 


Bit 0: Floating-Point Invalid Operation Mask (NM)—If the NM bit is 0, a Floating-Point 
Exception trap occurs when the input operands to a floating-point operation produce an 
indeterminate result (e.g., o times 0). If the NM bit is 1, a Floating-Point exeeption trap 
does not occur for invalid operations. 


Integer Environment Register (INTE, Register 161) 


This unprotected special-purpose register (Figure 2-7) contains control bits that affect 
the execution of integer multiplication and division operations. This register is not 

implemented directly by processor hardware, but is implemented by the virtual arithmetic 
interface. — 


Integer Environment Register 


31 23 15 7 0 

peere eee ag nda 
| Do! 

MO 


Bits 31-2: Reserved 


Bit 1: Integer Division Overflow Mask (DO)—'f the DO bit is 0, an Out-of-Range trap 
occurs when overflow of a signed or unsigned 32-bit result occurs during a DIVIDE or 
DIVIDU instruction, respectively. If the DO bit is 1, an Out-of-Range trap does not occur 
for overflow during integer divide operations. 


The DIVIDE and DIVIDU instructions always cause an Out-of- Range Trap upon division 
by zero, regardless of the value of the DO bit. 
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Bit 0: Integer Multiplication Overflow Exception Mask (MO)—If the MO bit is 0, an 
Out-of-Range trap occurs when overflow of a signed or unsigned 32-bit result occurs 
during a MULTIPLY or MULTIPLU instruction, respectively. If the MO bit is 1, an 
Out-of-Range trap does not occur for overflow during integer multiply operations. 
Because 64-bit results cannot overflow, this bit should be set to 1 when obtaining a 
64-bit result for multiplication to avoid Out-of-Range traps. 


STATUS RESULTS OF INSTRUCTIONS 


This section discusses the status information generated by arithmetic, logical and 
floating-point operations, and the special registers that contain this status information. 


ALU Status Register (ALU, Register 132) 


This unprotected special-purpose register (Figure 2-8) holds information about the 
outcome of Arithmetic/Logic Unit (ALU) operations as well as control for certain opera- 
tions performed by the execution unit. 


ALU Status Register 





DF 


Bits 31-12: Reserved 


Bit 11: Divide Flag (DF)—The DF bit is used by the instructions that implement division. 
This bit is set at the end of the division instructions either to 1 or to the complement of 
the 33rd bit of the ALU. When a Divide Step instruction is executed, the DF bit deter- 
mines whether an addition or subtraction operation is performed by the ALU. 


Bit 10: Overflow (V)—The V bit indicates that the result of a signed, two’s-complement 
ALU operation required more than 32 bits to represent the result correctly. The value of 
this bit is determined by exciusive-ORing the ALU carry-out with the carry-in to the most 
significant bit for signed, two’s-complement operations. This bit is not used for any 
special purpose in the processor and is provided for information only. 


Bit 9: Negative (N)—The N bit is set with the value of the most significant bit of the 
result of an arithmetic or logical operation. If two’s-complement overflow occurs, the N bit 
does not reflect the true sign of the result. This bit is used in divide operations. 


Bit 8: Zero (Z)—The Z bit indicates that the result of an arithmetic or logical operation is 
zero. This bit is not used for any special purpose in the processor, and is provided for 
information only. 


Bit 7: Carry (C)—The C bit stores the carry-out of the ALU for arithmetic operations. It is 
used by the add-with-carry and subtract-with-carry instructions to generate the carry into 
the Arithmetic/Logic Unit. 


Bits 6-5: Byte Pointer (BP)—The BP field holds a 2-bit pointer to a byte within a word. 
It is used by Insert Byte and Extract. Byte instructions. 
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The most significant bit of the BP field is used to determine the position of a half-word 
within a word for the Insert Half-Word, Extract Half-Word, and Extract Half-Word, 
Sign-Extended instructions. | 


The BP field is set by a Move To Special Register instruction with either the ALU Status 
Register or the Byte Pointer Register as the destination. It is also set by a load or store 
instruction if the Set Byte Pointer (SB) bit in the instruction is 1. A load or store sets the 
BP field with 11. 


The Byte Pointer Register (Section 3.1.3) provides direct access to this field. 


Bits 4-0: Funnel Shift Count (FC)—The FC field contains a 5-bit shift count for the 


funnel shifter. The funnel shifter concatenates two source operands into a single 64-bit 
operand and extracts a 32-bit result from this 64-bit operand; the FC field specifies the 
number of bit positions from the most significant bit of the 64-bit operand to the most 
significant bit of the 32-bit result. The FC field is used by the EXTRACT instruction. 


The FC field is set by a Move To Special Register instruction with either the ALU Status 
Register or the Funnel Shift Count Register as the destination. 


Arithmetic Operation Status Results 


The Arithmetic instructions modify the V, N, Z, and C bits. These bits are set according 
to the result of the operation performed by the instruction. 


All instructions in the Arithmetic class—except for MULTIPLY, MULTM, DIVIDE, MULTI- 
PLU, MULTMU, and DIVIDU—perform an add. In the case of subtraction, the subtract is 
performed by adding the two’s-complement or one’s-complement of an operand to the 
other operand. The multiply-step and divide-step operations also perform adds, again 
possibly complementing one of the operands before the operation is performed. In 
general, the status bits are based on the results of the add. 


If two’s-complement overflow occurs during the add, the V bit of the ALU Status Register 
is set; otherwise it is reset. Two’s-complement overflow occurs when the carry-in to the 
most significant bit of the intermediate result differs from the carry-out. When this occurs, 
the result cannot be represented by a signed word integer. Note that the V bit always is 
set in this manner, even when the result is unsigned. 


The N bit of the ALU Status Register is set to the value of the most significant bit of the 
result of the add. Note that the divide-step and multiply-step operations may shift the 
result after the operation is performed. In the cases where shifting occurs, the N bit may 
not agree with the result that is written into a general-purpose register, since the N bit is 
based only on the result of the add, not on the shift. 


If the result of the add causes a zero word to be written to a general-purpose register, 
the Z bit of the ALU Status Register is set; otherwise, it is reset. The Z bit always reflects 
the result written into a general-purpose register; if shifting is performed by a multiply or 
divide step, the Z bit reflects the shifted value. 


lf there is a carry out of the add operation, the C bit is set; otherwise it is reset. 


Logical Operation Status Results 


The Logical instructions modify the N and Z bits. These bits are set according the result 
of the instruction. The V and C bits are meaningless in regard to the nears, instructions, 
so they are not modified. 
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Figure 2-9 


The N bit of the ALU Status Register is set to the value of the most significant bit of the 
result of the logical operation. 


If the result of the logical operation is a zero word, the Z bit of the ALU Status Register is 
set; otherwise, it is reset. 


Floating-Point Status Results 


The Floating-Point instructions check for a number of exceptional conditions, and enon 
these exceptions by setting bits of the Floating-Point Status Register. The exceptional 
conditions may also cause traps, depending on the state of mask bits in the Floating- 
Point Environment Register. There are two groups of status bits in the Floating-Point 
Status Register: trap status bits and sticky status bits. When an exception is detected, 
the virtual arithmetic processor on the microcontroller sets the trap status bit and/or the 
sticky status bit associated with the exception, depending on the corresponding excep- 
tion mask bit and on whether or not a trap occurs. The sticky status bit is set whenever 
the corresponding exception is masked, regardless of whether or not a trap occurs. A 
trap status bit is set whenever a trap occurs, regardless of the state of the corresponding 
mask bit. 


A trap status bit is reset when a trap occurs and the indicated status does not apply to 
the trapping operation. A sticky status bit is reset only by software. 


Floating-Point Status Register (FPS, Register 162) 


This unprotected special-purpose register (Figure 2-9) contains status bits indicating the 
outcome of floating-point operations. This register is not implemented directly by 
processor hardware, but is implemented by the virtual arithmetic software. 


The floating-point status bits are divided into two groups. The first group consists of the 
sticky status bits (DS, XS, US, VS, RS, and NS), which, once set, remain set until 
explicitly cleared by a Move-to-Special-Register (MTSR) or Move-to-Special-Register- 
Immediate (MTSRIM) instruction. Only those sticky status bits corresponding to masked 
exceptions are updated. The update occurs at the end of instruction execution. 


The second group consists of the trap status bits (DT, XT, UT, VT, RT, and NT) that 
report the status of an operation for which a Floating-Point Exception trap is taken. 
These bits are updated only by an operation that takes a trap as a result of an un- 
masked Floating-Point Exception; all other operations leave these bits unchanged. A 
trap status bit is updated regardless of the state of the corresponding exception mask in 
the Floating-Point Environment Register. 


ea Status | 


Bits 31-14: Reserved 


Bit 13: Floating-Point Divide-By-Zero Trap (DT)—The DT bit is set when a Floating- 
Point Exception trap occurs and the associated floating-point operation is a divide with a 
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zero divisor and a non-zero, finite dividend. Otherwise, this bit is reset when a Floating- 
Point Exception trap occurs. 


Bit 12: Floating-Point Inexact Result Trap (XT)—The XT bit is set when a Floating- 
Point Exception trap occurs and the result of the associated floating-point operation is 
not equal to the infinitely-precise result. Otherwise, this bit is reset when a Floating-Point 
Exception trap occurs. 


Bit 11: Floating-Point Underflow Trap (UT)—The UT bit is set when a Floating-Point 
Exception trap occurs and the result of the associated floating-point operation is too 
small to be expressed in the destination format. Otherwise, this bit is reset when a 
Floating-Point Exception trap occurs. 


Bit 10: Floating-Point Overflow Trap (VT)—The VT bit is set when a Floating-Point 
Exception trap occurs and the result of the associated floating-point operation is too 
large to be expressed in the destination format. Otherwise, this bit is reset when a 
Floating-Point Exception trap occurs. 


Bit 9: Floating-Point Reserved Operand Trap (RT)}—The RFT bit is set when a 
Floating-Point Exception trap occurs and the result of the associated floating-point 
operation is a reserved value. Otherwise, this bit is reset when a Floating-Point Excep- 
tion trap occurs. . 


Bit 8: Floating-Point Invalid Operation Trap (NT)—The NT bit is set when a Floating- 
Point Exception trap occurs and the input operands to the associated floating-point 
operation produce an indeterminate result. Otherwise, this bit is reset when a Floating- 
Point Exception trap occurs. 


Bits 7-6: Reserved 


Bit 5: Floating-Point Divide-By-Zero Sticky (DS)—The DS bit is set when the DM bit 
of the Floating-Point Environment Register is 1, the divisor of a floating-point division 
operation is a zero, and the dividend is a non-zero, finite number. 


Bit 4: Floating-Point Inexact Result Sticky (XS)—The XS bit is set when the XM bit of 
the Floating-Point Environment Register is 1 and the result of a floating-point operation 
is not equal to the infinitely precise result. 


Bit 3: Floating-Point Underflow Sticky (US)—The US bit is set when the UM bit of the 
Floating-Point Environment Register is 1 and the result of a floating-point operation is 
too small to be expressed in the destination format. 


Bit 2: Floating-Point Overflow Sticky (VS)—The VS. bit is set when the VM bit of the 
Floating-Point Environment Register is 1 and the result of a floating-point operation is 
too large to be expressed in the destination format. 


Bit 1: Floating-Point Reserved Operand Sticky (RS)}—The RS bit is set when the RM 
bit of the Floating-Point Environment Register is 1 and either one or more input oper- 
ands to a floating-point operation is a reserved value or the result of a floating-point 
operation is a reserved value. 


Bit 0: Floating-Point Invalid Operation Sticky (NS)—The NS bit is set when the NM 
bit of the Floating-Point Environment Register is 1 and the input operands to a floating- 
point operation produce an indeterminate result. 


INTEGER MULTIPLICATION AND DIVISION 


The Am29200 and Am29205 microcontrollers do not directly support the instructions 
MULTIPLU, MULTMU, MULTIPLY, MULTM, DIVIDE, and DIVIDU. The processor is 
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capable of performing these instructions as a sequence of multiply or divide steps, which 
are directly supported by hardware. A special register, Q, is used in conjunction with the 
SRCA and SRCB operands to execute the multiply or divide step. This section describes 
the Q register and discusses the general method for multiplication and division. 


Q Register (Q, Register 131) 
The Q Register is an unprotected special-purpose register (Figure 2-10). 


Q Register 


31 23 15 7 0 


Bits 31-0: Quotient/Multiplier (Q)—During a sequence of divide steps, this field holds 
the low-order bits of the dividend; it contains the quotient at the end of the divide. During 
a sequence of multiply steps, this field holds the multiplier; the field contains the 
low-order bits of the result at the end of the multiply. 


For an integer divide instruction, the Q field contains the high-order bits of the dividend 
at the beginning of the instruction, and contains the remainder upon completion of the 


instruction. 


Multiplication 

The processor performs integer multiplication by a series of multiply-step instructions. 
Note that when the product of a constant and a variable is to be computed, a more © 
efficient sequence of shift and add instructions can usually be found. Many compilers 
use this technique automatically. 


If a program requires the multiplication of two integers, the required sequence of multiply 
steps may be executed in-line or executed in a multiply routine called as a procedure. It 
may be beneficial to precede a full multiply procedure with a routine to discover whether 
or not the number of multiply steps may be reduced. This reduction is possible when the 
operands do not use all of the available 32 bits of precision. 


The following routine multiplies two 32-bit signed integers, giving a 64-bit result. 
Unsigned multiplication can be performed by substituting the MULU instruction for the 
MUL and MULL instructions. 


; 32 bit * 32 bit —>64 bit signed multiply 
; Input: = multiplicand in Ir2, multiplier in Ir3 
; Output: result most significant word in gr96, result least significant word in gr97 


SMul64: 


misr Q, Ir3 ; put multiplier in the Q register 

mul gr96, Ir2, 0 ; perform initial multiply step 

rep 30 ; expand out 30 copies of the next instruction 
; in-line | 

mul gr96, Ir2, gr96 ; total of 30 more multiply steps 

.endr 3 

mull gr96, Ir2,gr96 ; perform last sign correcting step 

mfsr §gr97,Q ; get the least significant result word 


The following routine multiplies two 32-bit integers, returning a 32-bit result. It attempts 
to minimize the number of multiply-step instructions by checking the input operands. It is 
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coded as a subroutine, with pointers to its operands passed in the indirect pointers IPC, 
IPA, and IPB. This allows the routine to operate on any combination of registers, rather 
than forcing the operands to be in fixed registers. 


; 32 bit * 32 bit —> 32 bit signed or unsigned multiply called by: 


call tpc, MUL32 : call the multiply routine 
: setip dst_reg, src1_reg, src2_reg ; passing pointers to the operand registers 
; ; in the delay slot 


; Input: § operands in the registers pointed to by indirect-pointer registers IPA and IPB 
; Output: result least significant word in the register pointed to by IPC 
; Used: __ return address in tpc, special registers Q and FC 
; Destroy: previous contents of registers tpc, Temp0 — Temp2 
; Symbolic register names: 
eg Tempo, gr116 
reg Tempt, gr119 
reg Temp2, gri20 
eg tpc, gr122 . 
.word 0x00200000 ; Debugger tag word 


Mul32: 
; need an instruction to separate SETIP (probably last instruction) from access of indirect 
; pointers 

mtsrim FC,8 ; useful when one operand is 8-bit 

or Tempo, grO, O ; copy value of IPA register 


; next check to see that the operand with the most leading zeros becomes the multiplier 
cpgtu Temp1,gr0,grO 


jmpf Temp1,do8 ; the operands are already ordered correctly 
or Temp1,Temp1,gr0 ; if it jumps, Temp1 holds 0, so this copies 
| ; the value of the IPB register 
const Temp0,0 ; swap the operands 
or Temp0, Temp0,gr0 
or Temp1,gr0,0 
dos: 
cpleu Temp2, Temp1 ,Ox7f ; less than 8 bits? 
jmpf Temp2,do16 ; no, check for 16 bits 


mtsr Q,TempO 
mulu Temp0,Temp1,0 


rep 7 ; expand out 7 copies of the next instruction 
; in-line 

mutlu Tempo, femp1,Temp0 ; total of 7 more multiply steps 

.endr . 


; the top 24 bits of the result are in the lower 24 bits of TempO, and the bottom 8 bits are in the 
> top of Q 
mfsr Temp1,Q 


jmpi tpc ; return to the calling routine 
extract gr0,femp0,femp1 ; extract the result in the delay-slot of the 
; jump 
do16: 
const Temp2,0x7fff ; less than 16 bits? 
cplequ) Temp2,Temp0,Temp2 | 
jmpf Temp2,do32 ; no, perform all 32 steps 
mulu Temp0, Temp1,0 ; perform initial multiply-step 
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mulu 
.endr 


15 ; expand out 15 copies of next instruction 
; in-line 
Tempo, Temp1, Tempo ; total of 15 more multiply-steps 


; the top 16 bits of the result will be in the lower 16 bits of Tempo, the bottom 16 bits in the top 


-of Q 
mtsrim 
mfsr 


jmpi 
extract 


do32: 
mulu 


rep 


mulu 
.endr 


jmpi 
mfsr 


2.6.3 Division 


FC,16 ; extract on bit-16 boundary 

Temp1,Q 

tpc ; return to the calling routine 

gr0, Temp0,Tempt1 ; extracting the result in the delay-slot of the 
; Jump 

temp0,Temp1,0 ; perform initial step 

31 ; expand out 32 copies of the next instruction 
; in-line 

Temp0O, Temp1, femp0O ; total of 31 more multiply steps 

tpc | ; teturn to calling routine 

gr0,Q ; Copy the result to the return register in the 
; delay slot 


The processor performs integer division by a series of divide-step instructions. When the — 
divisor is a power of 2 and the dividend is unsigned, the divide should be accomplished 


by a right shift. 


If a program requires the division of two integers, the required sequence of divide steps 
may be executed in-line or executed in a divide routine called as a procedure. It may be 
beneficial to precede a full divide procedure with a routine to discover whether or not the 
number of divide steps may be reduced. This reduction is possible when the operands 
do not use all of the available 32 bits of precision. 


The following routine divides a 64-bit, unsigned dividend by a 32-bit unsigned divisor. . 


; 64 bit / 32 bit > 32 bit unsigned divide 

; Input: = most significant dividend word in Ir2, least significant dividend word in Ir3, 
: divisor in Ir4 

; Output: quotient in gr96, remainder in gr97 


UDiv64: 
mtsr 


divO 
ep 


div 
.endr 
divi 


divrem 
mfsr 


Q, Ir3 | ; put least significant word of the dividend in 
; the Q register 

gr97, Ir2 ; perform initial divide step 

31 | | ; expand out 31 copies of the next 
; Instruction in-line 

gr97, gr97, Ir4 ; total of 30 more divide steps 

gr97, gr97, Ir4 ; perform last step 

gr97, gr97, Ir4 ; compute remainder 

gr96, Q ; get the quotient 


The following routine divides a 32-bit unsigned dividend by a 32-bit unsigned divisor. 


; 32 bit / 32 bit > 32 bit unsigned divide 
; Input: — dividend word in Ir2, divisor in Ir4 
; Output: quotient in gr96, remainder in gr97 
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UDiv32: 
mtsr 
divO 


rep 


div 
.endr 
divl 


divrem 
mfsr 


Q, !r2 

gr97, 0 

31 

gr97, gr97, Ir4 
gr97, gr97, Ir4 


gr97, gr97, Ir4 
gr96, Q 
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; put the dividend in the Q register 
; perform initial divide step, zeroing out 
; the upper bits of the dividend 


; expand out 31 copies of the next 
; instruction in-line 
; total of 30 more divide steps 


; perform last step 
; compute remainder 
; get the quotient 


The following routine divides a 32-bit signed dividend by a 32-bit signed divisor. It also 
traps division by zero. Because the divide-step instructions only operate on unsigned 
operands, extra code is required to perform sign checking and conversion. 


; 32 bit / 32 bit signed divide, called by: 


: call 

; setip 
~ § Input: 

; Output: 

; Used: 


tpc, SDiv32 


dst_reg, src1_reg, src2_reg 


; Call the divide routine 


; passing pointers to the operand 


registers in the delay slot 


dividend and divisor in the registers pointed to by the indirect-pointer 
registers IPA and IPB 
result quotient in the register pointed to by IPC, remainder left in TempO 
return address in tpc, special register Q 


; Destroyed: previous contents of registers tpc, TempO — Temp2 
; Symbolic register names: 


eg 
eg 
reg 
.feg 
.word 


SDiv32: 
const 
asneq 


add 
jmpf 
add 
const 
subr 


pdividend: 
jmpf 
mtsr 


xor 
subr 


pdivisor: 
divO 


rep — 


div 
.endr 


divI 


Tempo, gr116 
Temp1, gr119 
Temp2, gr120 
tpc, gr122 
0x00200000 


Temp1, 0 


V_DIVBYZERO, Temp1, gr0 


Tempo, gr0, 0 
Tempo, pdividend 
Temp2, Temp1, grQ 
Temp1, 3 

Tempo, Tempo, 0 


Temp2, pdivisor 
Q, TempO 


Temp1, Temp1, 1 
Temp2, Temp2, 0 


Tempo, 0 
31 


Temp0O, TempO, Temp2 


Temp0, Temp0, Temp2 
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check for divide by zero with an assert 
get dividend from indirect pointer 

is it negative? (jmpf is also “jmppos”) 
get divisor from indirect pointer 

set negative result and remainder flags 


; make dividend positive 


is divisor negative? 

copy dividend to Q register in delay slot 
of the jump 

turn off negative result flag 

make divisor positive 


initialize 
expand out 31 copies of the next 


instruction in-line 
total of 30 more divide steps 


perform last divide step 
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divrem Temp0, TempO, Temp2 ; get positive remainder 
mfsr Temp2, Q get positive quotient 


sll Temp1, Temp1, 30 ; Copy negative remainder flag to test bit 
jmpf Temp1, premainder ; If itis not set, remainder is ok 
sll Temp1, Temp1, 1 ; Copy negative result flag to test bit 


subr Temp0, Temp0, 0 negate remainder 


premainder: 
jmpfi Temp1, tpc 
add gr0, Temp2, 0 


return to caller if result is positive 
copying quotient to the result register 
in the delay slot 

else return to caller, 

negating the quotient in the delay slot 


_jmpi tpc 
subr gr0, Temp2, 0 
I NEED AN INSTRUCTION FOR... 


This section discusses topics of general concern in the implementation of application 
programs. 


we we we we we 


Run-Time Checking 


The assert instructions provide programs with an efficient means of comparing two 
values and causing a trap when a specified relation between the two values is not 
satisfied. The instructions assert that some specified relation is true and trap if the 
relation is not true. This allows run-time checking, such as checking that a computed 
array index is within the boundaries of the storage for an ane to be performed with a 
minimum performance penalty. 


Assert instructions are available for comparing two signed or unsigned operands. The 
following relations are supported: equal-to, not-equal-to, less-than, less-than-or-equal-to, 
greater-than, and greater-than-or-equal-to. 


The assert instructions specify a vector number for the trap. However, only vector 


-numbers 64 through 255 (inclusive) may be specified by User-mode programs. If a 


User-mode assert instruction causes a trap and the vector number is between 0 and 63 
inclusive, a Protection Violation trap occurs, instead of the specified trap. 


Since the assert instructions allow the specification of the vector number, several traps 
may be defined in the system for different situations detected by the assert instructions. 


Operating-System Calls 


An applications program can request a service from the operating system by using the 
following instruction: 


asneq System_Routine, grt, gr 


This instruction always creates a trap since it attempts to assert that the content of a 
register is not equal to itself (the register number used here is irrelevant, as long as the 
register is otherwise accessible). 


The System_Routine vector number specified by the instruction invokes the execution of 
the operating system routine that provides the requested service. This vector number 
may have any value between 64 and 255, inclusive (vector numbers 0 through 63 are 
pre-defined or reserved). Thus, as many as 192 different operating-system routines may 
be invoked from the applications program. 


In cases where the indirect pointers may be used, the EMULATE instruction allows two 
operand/result registers to be specified to the opera: system routine. The instruction 
is as follows: . 
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emulate System_Routine, Ir3, Ir6é 


In this case, the System_Routine vector number performs the same function as in the 
previous example. Here, however, LR3 and LR6 are specified as operand registers 
and/or result registers (these particular registers are used only for illustration). The 
operating-system routine has access to these registers via the indirect pointers, which 
allows flexible communication. 


Multiprecision Integer Operations 


The processor allows the Carry (C) bit of the ALU Status Register to be used as an 
operand for add and subtract instructions. This provides for the addition and subtraction 
of operands that are greater than 32 bits in length. For example, the following code 
implements a 96-bit addition with signed overflow detection. 


add Ir7, gr96, Ir2 
addc=—swtr8, «=o gr 97, 13 
addcs_ [r9, gr98, Ir4 


Global registers GR96—GR98 contain the first operand, local registers LR2—LR4 contain 


the second operand, and local registers LR7—LR9 contain the result. The first two add 


instructions (ADD and ADDC) set the C bit, which is used by the second two instructions _ 


(ADDC and ADDCS). If the addition causes a signed overflow, then an Out-of-Range 
trap occurs; overflow is detected by the final instruction. 


Complementing a Boolean 


To complement a Boolean in the processor’s format, only the most significant bit of the 
Boolean word should be considered, since the least significant 31 bits may or may not 
be zeros. This is accomplished by the following instruction: 


cpge gr96, gr96, 0 


The Boolean is in GR96 in this example. This instruction is based on the observation 
that a Boolean TRUE is a negative integer, since the Boolean bit coincides with the 
integer sign bit. If the operand of this instruction is a negative integer (i.e., TRUE), the 
result is the Boolean FALSE. If the operand is non-negative (i.e., the Boolean FALSE), 
the result is TRUE. Note that this instruction clears the least significant 31 bits. 


Large Jump and Call Ranges 


‘The 16-bit relative branch displacement provided by processor instructions is sufficient in 


the majority of cases. However, addresses with a greater range are occasionally 
needed. In these cases, the CONST and CONSTH instructions generate the large 
branch-target address in a register. An indirect jump or call then uses this address to 
branch to the appropriate location. , 


NO-OPs 


When a NO-OP is required for proper operation (@.g., as described in Section 5.6), it is 
important that the selected instruction not perform any operation, regardless of program 
operating conditions. For example, the NO-OP cannot access general-purpose registers 
because a register may be protected from access in some situations. The suggested 
NO-OP is: 

aseq 0x40, gri, gr 
This instruction asserts that the Stack Pointer (GR1) is equal to itself. Since the asser- 
tion is always true, there is no trap. Note also that the Stack Pointer cannot be protected, 
and that the assert instruction cannot affect any processor state. 
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2.8 VIRTUAL ARITHMETIC PROCESSOR 


In order to be object-code compatible with present and future implementations of the 
29K Family of microprocessors, the Am29200 and Am29205 microcontrollers provide 
Virtual arithmetic software. A virtual implementation is the means by which a processor 
appears to perform functions that it does not actually perform. In the case of the 
Am29200 and Am29205 microcontrollers, the processor defines arithmetic instructions, 
control, and status which are not directly supported by hardware, but which are imple- 
mented by system software. 





2.8.1 Trapping Arithmetic Instructions 
The processor does not incorporate hardware to directly support floating-point opera- 
tions, nor does it directly support full multiply and divide instructions. However, instruc- 
tions to perform these operations are included in the instruction set. These instructions 
are included for compatibility with processor implementations, such as the Am29050 
microprocessor, that have hardware to perform these operations. 


In application programs that must be fully object-code compatible across several 
processor versions—while taking advantage of the performance of the versions having 
arithmetic hardware—the defined instructions should be i to perform floating-point, 
multiplication, and division operations. 


In the Am29200 and Am29205 microcontrollers, the Floating-Point, CLASS, CONVERT, 
MULTIPLY, MULTM, MULTIPLU, MULTMU, DIVIDE, DIVIDU, and SQRT instructions 
cause traps. The indirect pointers are set at the time the trap occurs, so a trap handler 
can gain access to the operands of the instruction and can determine where the result is 
to be stored. A trap handler can directly emulate the execution of the instruction. 


2.8.2 Virtual Registers 
The processor does not incorporate hardware to directly support the Floating-Point 
Environment Register (FPE), Integer Environment Register (INTE), or Floating-Point 
Status Register (FPS). When one of these registers is referenced by a MTSR/MFSR 
instruction (or a variant), a Protection Violation trap occurs. The Protection Violation trap 
handler must establish that the faulting instruction is a MTSR/MFSR and that the register 
specified by the instruction is one of the registers supported by the virtual interface. This is 
accomplished by obtaining the faulting instruction from memory and examining the 
OPCODE and SRC/DEST fields. The trap handler then simulates the operation of the 
register. 


29 PROCESSOR INITIALIZATION 


When power is first applied to the processor, it is in an indeterminate state and must be 
placed in a known state. Also, under certain circumstances, it may be necessary to place 

the processor in a defined state. This is accomplished by the Reset mode, which places 
the processor into a predefined state. 


2.9.1 Configuration Register (CFG, Register 3) 
This protected special-purpose register (Figure 2-11) controls certain processor and 
system options. The Configuration Register is defined as follows: 





Figure 2-11 Configuration Register 
31 23 15 7 0 
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Bits 31-24: Processor Release Level (PRL)—The PRL field is an 8-bit, read-only 
identification number which specifies the processor version. 


Bits 23-0: Reserved 


2.9.2 Reset Mode | 
The Reset mode is invoked by asserting the RESET input. The Reset mode is entered 
within four processor cycles (MEMCLK cycles) after RESET is asserted. The RESET 
input must be asserted for at least four processor cycles to accomplish a processor 
reset. 








The Reset mode can be entered at any point during operation. If the RESET input is 
asserted at the time power is first applied to the processor, the processor enters the 
Reset mode only after four cycles have occurred on the MEMCLK pin. 


The Reset mode configures the processor state as follows: 


1. Instruction execution is suspended. 

2. Instruction fetching is suspended. 

3. Any interrupt or trap conditions are ignored. 
4 


. The Current Processor Status Register (see Section 16.2.1) is set as shown in 
Figure 2-12. 


5. The Contents Valid (CV) bit of the Channel Control Register (see Section 16.7.2.3) 
is reset. 


Except as previously noted, the contents of all general-purpose registers and special- 
‘purpose registers are undefined. | 


The Reset mode is exited when the RESET input is deasserted. Either four or five cycles 
after RESET is deasserted (depending on internal synchronization time), the processor | 
‘performs an initial instruction access on the external interface. The initial instruction 
access is directed to address 0, which is in ROM Bank 0 after a reset. Setting the 
characteristics of the ROM in Bank 0 during reset is described in Section 8.2.3. 


A processor reset configures the internal peripherals as follows: 


1. In the ROM controller, ROM Bank 0 on the Am29200 microcontroller is configured by 
the BOOTW signal; the boot ROM in ROM Bank 0 on the Am29205 microcontroller is 
always 16 bits wide.The other banks are set so as not to interfere with accesses to 
ROM Bank 0. | 


Figure 2-12 Current Processor Status Register In Reset Mode 
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. The DRAM configuration is not set by a processor reset, DRAM mapping is disabled, 


and the refresh rate is set to the slowest possible value (refresh every 511 MEMCLK 
cycles). 


. The configuration of the peripheral interface adapter is not set by a processor reset. 
. The DMA controller is disabled, DRM fields are reset to 0, and all state machines are 


reset. 


. The POEN field of the PIO Output Enable Register (see Section 12.2.4) is reset to 0, 


making all PIO pins inputs. 


. The parallel port is disabled and all state machines are reset. 
7. The serial port is disabled and all state machines are reset. 
. The video interface is disabled and all state machines are reset. All signals that may 


be either inputs or outputs are configured as inputs. 
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4. PROCEDURE LINKAGE 


This chapter describes the run-time storage organization recommended for the 
Am29200 and Am29205 microcontrollers and describes the use of the local registers to 
improve the performance of procedure calls. The presentation in this chapter is intended 
as a guide for implementing microcontroller software systems, not necessarily as a strict 
definition of how these systems must be implemented. 


Programming languages that use recursive procedures, such as C, generally use a 
stack to store data objects dynamically allocated at run-time. The organization of the 
run-time storage, including the run-time stack, determines how data objects are stored 
and how procedures are called at the machine level. The microcontroller is designed to 
minimize the overhead of calling a procedure, passing parameters to a procedure, and 
returning results from a procedure. This chapter describes the run-time storage orga- 
nization and procedure-calling conventions. 


4.1 RUN-TIME STACK ORGANIZATION AND USE 


A run-time stack consists of consecutive overlapping structures called activation records. 
An activation record contains dynamically allocated information specific to a particular 
activation (or call) of a procedure (such as local data objects). Because of recursion, 
multiple copies of a procedure may be active at any given time. Each active procedure 
has its own unique activation record allocated somewhere on the run-time stack. The 
local variables required by a particular procedure activation are contained in the 
activation record associated with that activation. Thus, the local variables for different 
activations do not interfere with one another. A compiler generates the instructions to 
create and manage the run-time stack, and compiler-generated instructions are based 
on its existence. ; . 


As an example, Figure 4-1 shows three activation records on a run-time stack. This 
stack configuration was generated by procedure A calling procedure B, which in turn 
called procedure C. The fact that procedure C is the currently active procedure is 
reflected by its activation record being on the top of the run-time stack. The Stack 
Pointer points to the top of procedure C’s activation record. 


In Figure 4-1, the storage areas labeled Out args and In args are the outgoing argu- 
ments area (for the caller) or the incoming arguments area (for the callee). These are 
shared between the caller procedure and the callee for the communication of parame- 
ters and results. The areas labeled Locals contain storage for local variables, temporary 
variables (for example, for expression evaluation), and any other items required for the 
proper execution of the procedure. 


4.1.1 Management of the Run-Time Stack 


A run-time stack starts at a high address in memory and grows toward lower memory 
addresses as procedures are called. The bottom of the stack is the location with a high 
address at which the stack starts; the top of the stack is the location with a lower 
address at which the most recent activation record has been allocated. 


When a procedure is called, a new activation record might need to be allocated on the 
run-time stack. An activation record is allocated by subtracting from the stack pointer the 
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Figure 4-1 Run-Time Stack Example 
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When storage for an activation record is allocated, the number of storage locations 
allocated is the sum of the number of locations needed for: 


1. Local variables 
2. Restarting the caller, such as locations for return addresses 


3. Arguments of procedures that may be called in turn by the called procedure (the out- 
going arguments area) 


In some cases storage is not required for one or more of the above items. Also, the 
incoming arguments area, though part of the activation record of the callee, is not 

. allocated storage at this time, because this storage was allocated as the outgoing 
arguments area of the calling procedure. 


An activation record is deallocated, just prior to returning to the caller, by adding to the 
stack pointer the value subtracted during allocation. 


In the Am29200 and Am29205 microcontrollers, run-time storage is actually implement- 
ed as two stacks: the Register Stack and the Memory Stack. Storage is allocated and 
deallocated on these stacks at the same time. The Register Stack stores activation 

~ records associated with all active procedures (except leaf routines, as described later). 
The Memory Stack stores activation-record information that does not fit into the Register 
Stack or that must be kept in memory for other reasons (e.g., because of pointer 
dereferences). Both the Register Stack and the Memory Stack are stored in the external 
data memory. However, a portion of the Register Stack is kept in the processor's local 
registers for performance. The term stack cache in this section refers to the use of the 
local registers to contain a portion of the Register Stack. 
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Register Stack 


The Register Stack contains activation records for active procedures (Figure 4-2). An 
activation record in the Register Stack stores the following information: 


m= input arguments to the called procedure. This portion of the activation record is 
shared between a caller and the callee. It is allocated by the caller as part of the call- 
er’s activation record. 


m The caller's frame pointer. This is the address of the lowest-addressed byte above 
the highest-address word of the caller’s activation record, and is used to manage the 
Register Stack. This portion of the activation record is shared between a caller and 
the callee. It is allocated by the caller as part of the caller’s activation record. 


= The caller’s return address. This is used to resume the execution of the caller after 
the called procedure terminates. This is also part of the caller's activation record. 


= The memory frame pointer. This is the address of the top of the caller’s Memory 
Stack (see below). This address is stored by the callee (if required), and used to re- 
store the memory stack upon return. 


The local variables of the called procedure, if any. 
Outgoing parameters of the called procedure, if any. 
The frame pointer of the called procedure, if the procedure calls another procedure. 


The return address for the called procedure, if the procedure calls another procedure. 
This location is allocated in the Register Stack, and is used when the called proce- 
dure calls another procedure. 
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Figure 4-3 


Local Registers as a Stack Cache 


The Am29200 and Am29205 microcontrollers are designed for efficient implementation 
of the Register Stack. Specifically, each microcontroller can use the large number of 
relatively addressed local registers to cache portions of the Register Stack, yielding a 
significant gain in performance. Allocation and deallocation of activation records occurs 
largely within the confines of the high-speed local registers, and most procedure calls 
occur without external references. Furthermore, during procedure execution, most data 
accesses occur without external references, because activation-record data are 
referenced most frequently. The principle of locality of reference, which allows any cache 
to be effective, also applies to the stack cache. The entries in the stack cache are likely 
to remain there for re-use, because the size of the Register Stack does not change very 
much over long intervals of program execution. Activation records are typically small, so 
the 128 locations in the local register file can hold many activation records. 


Allocating Register-Stack activation records in the local registers is facilitated by the Stack 
Pointer in Global Register 1. During the execution of a procedure, the Stack Pointer points 
simultaneously to the top of the Register Stack in memory and to the local register at the 
top of the stack cache. In other words, Global Register 1, a word-length register, contains 
the 32-bit address of the top of the Register Stack, while bits 8~2 of Global Register 1 
(with a 1 appended to the most significant bit) indicate the absolute register number of 
Local Register 0. Allocation and deallocation of the Register Stack is accomplished by 
subtracting from or adding to, respectively, the value of the Stack Pointer. 


Using this register-addressing scheme, locations from the Register Stack are automati- 
cally mapped into the local register file. Figure 4-3 shows the relationship between the 
Register Stack and the stack cache in the local registers. As shown, pointers are | 
required to define the boundaries between the Register Stack and the stack cache. 
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m The register free bound pointer (fb, gr127) defines the boundary between the portion 
of the Register Stack cached in the local registers and the portion stored in the exter- 
nal data memory. The rfb pointer contains the address of the first word in the Register 
Stack that is not contained in the local registers, but which is in memory. 


w The frame pointer (fp, Ir1) contains the memory address of the lowest-addressed 
word not in the current activation record. The fp is used to determine whether the call- 
er’s complete activation record is contained in the local registers when a procedure 
returns from a call, as described later. 


m@ The register stack pointer (rsp, gr1) points to the top of the Register Stack either in 
the local registers or the memory. The rsp is contained in the focal-register Stack 
Pointer (Global Register 1). The top of the Register Stack may or may not be con- 
tained in the data memony. The rsp simply defines the location of the top of the Regis- 
ter Stack. 


w The register allocate bound pointer (rab, gr126) defines the lowest-addressed stack 
location that can be cached within the local registers. This defines the limit to which 
local registers can be allocated in the Register Stack. 


Several activation records may exist in the Register Stack at any given time, but only 
one stack location may be mapped to a local register at a given time. When the Register 
Stack grows beyond the 128-word capacity of the local registers, some movement of 
data between the stack cache and the Register Stack in data memory must occur. 


Stack overflow occurs when a procedure is called, but the activation record of the callee 
requires more registers than can be allocated in the stack cache (this is detected by 
comparing rsp with rab). Figure 4-4 illustrates stack overflow. In this case, the contents 
of a number of registers must be moved to data memory. The number of registers 
involved must be sufficient to allow the entire activation record of the callee to reside in 
the local registers. A block of the registers is copied, or spilled, into an area of external 
data memory, freeing space in the local register file for the most recent procedure call. 


Stack underflow occurs when a procedure returns to the caller, but the entire activation 
record of the caller is not resident in the stack cache (this is detected by comparing fp 
with rfb). Figure 4-5 illustrates stack underflow. In this case, the non-resident portion of 
the cailer’s stack must be moved from data memory to the local registers. Underflow 
occurs because overflow occurred at some previous point during program execution, 
causing part of the Register Stack to be moved to memory. 


The processor performs no hardware management of the stack cache and cannot detect 
a reference to a quantity that is not in the stack cache. Consequently, software must 
keep the size of an activation record less than or equal to the size of the local register 
file (128 words). Any additional storage requirements are satisfied by the Memory Stack. 


Memory Stack 


In general, the Memory Stack is used to augment the Register Stack, holding additional 
information associated with activation records. For example, the Memory Stack holds 
large data structures that cannot fit into the Register Stack. Similar to the Register Stack, 
the Memory Stack contains a series of (possibly overlapping) activation records, each 
corresponding to a procedure activation. However, a Memory Stack activation record 
need not exist for a procedure that does not need a Memory Stack Area. The Memory 
Stack contains the following information: 


= Overflow incoming arguments. These are incoming arguments that do not fit in the 
allowed incoming arguments area of the Register Stack activation record. 
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B Spilled incoming arguments. These are incoming arguments that cannot be kept in 
the Register Stack. For example, if the address of an argument is used in a called 
procedure, the associated value must be in the Memory Stack. 


m Any procedure-local variable not allocated to a register. 


m Local block space. This storage is allocated dynamically on the Memory Stack. 
It is used to implement functions such as the alloca() function in the C programming 
language. 


m Overflow outgoing arguments. These are outgoing arguments that do not fit in the 
allowed outgoing arguments area of the Register Stack activation record. 


In contrast to the Register Stack, the Memory Stack is not cached and has no fixed size 
limit. The top of the Memory Stack is defined by the memory stack pointer (msp), which 
is stored in Global Register 125 by convention. 


PROCEDURE LINKAGE CONVENTIONS 


The procedure linkage conventions define the standard sequences of instructions used 
to call and return from procedures. These instruction sequences perform the following 
operations (other, more general operations may also be required, as described later): 


m= Put procedure arguments into the outgoing arguments area of the activation record. 
This may or may not involve copying the arguments; copying is not necessary if the 
arguments are placed into the appropriate registers as the result of computation. 
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= Branch to the procedure using a call instruction, which also paces the return address 
in a register. 


w Allocate a frame on the Register Stack. A frame is the storage that contains the pro- 
cedure’s activation record. 


mw |f overflow occurs during frame allocation, spill the least recently used locations of the 
Register Stack. The number of spilled locations must be sufficient to allow the new 
frame to reside entirely within the local registers. 


m Determine the frame-pointer value of the called procedure, if this procedure may call 
another procedure. 


Execute the procedure. 
Place return values into the appropriate registers. 
Deallocate the activation-record frame. 


Fill locations of the local registers from the Register Stack in external memory, if un- 
derflow occurs. 


m@ Branch to the procedure’s return address. 


This section describes the routines that implement the procedure linkage conventions. 
The operations described here are not required on every procedure call. In some cases, 
operations can be omitted or simpler routines used; these cases and the accompanying 
simplifications are also described here. 


Argument Passing 


The linkage convention allows up to 16 words of arguments to be passed from the caller 
to the callee in local registers. These arguments are passed in Local Register 2 through 
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Local Register 17 of the caller (note that the local-register numbers are different for the 
caller and the callee, because of Stack-Pointer addressing). 


When more than 16 words are required to pass arguments, the additional words are 
passed on the Memory Stack. In this case, the memory stack pointer (in Global Register 
125) points to the seventeenth word of the arguments, and the remaining argument 
words have higher memory addresses. Multiword arguments may be split across the 
Register Stack and the Memory Stack. For example, if a multiword argument starts on 
the sixteenth word of the outgoing arguments, the first word of the argument is passed in 
the Register Stack, and the remainder of the argument is passed in the Memory Stack. 


All arguments occupy at least one word. Arguments that are a byte or half-word in length 
(for example, a character) are padded to 32 bits and passed as a full word. However, an 
array or structure composed of multiple byte or half-word components can be passed as 
a single, packed array or structure of bytes or half-words rather than an array or 
structure of padded bytes or half-words. 


No argument is aligned to anything other than a word address boundary, including 
multiword arguments. Some multiword arguments are referenced as a single object (for 
example, double-precision floating-point values). It may be necessary to copy such 
arguments to an aligned memory or register area before use. 


Procedure Prologue 


When a procedure is called and the procedure may call another procedure, the callee 
must allocate a frame for itself on the Register Stack (this is not required for leaf 
procedures that do not call other procedures, as described later). A frame is allocated by 
decrementing the register stack pointer to accommodate the size of the required 
activation record. The procedure prologue is the instruction sequence that allocates the 
callee’s Register Stack frame. 


To allocate the stack frame, the prologue routine decrements the register stack pointer 
by the amount rsize (see Figure 4-6). The value of rsize must be an even number given 
by the following formula: 


rsize = (size of local variable area) + (size of outgoing arguments area) + 2 


The value 2 in this formula accounts for the space required by the return address (in 
Local Register 0) and the frame pointer (in Local Register 1). The size of the local 
variable area includes the space for the memory frame pointer, if required. If the formula 
total is an odd value, the total must be adjusted (by adding 1) so the resulting rsize value 
is even. This aligns the top of the Register Stack on a double-word boundary. The 
reason for this alignment is that double-precision floating-point values must be aligned to 
registers with even absolute-register numbers. Alignment of double-precision values is 
accomplished by placing these values into even-numbered local registers and making 
rsize even (it is also assumed that the register stack pointer is initialized on an even- 
word boundary). 


Resize is not the size of the entire activation record of the callee, because the callee’s 
activation record includes storage that was allocated as part of the caller’s activation 
record frame (e.g., the caller’s outgoing arguments area, which is the callee’s incoming 
arguments area). The size of the callee’s entire activation record is denoted size and is 
given by the following formula: 


size= rsize + (size of the incoming arguments area) + 2 


Procedure Linkage 


AMD Ll 


Figure 4-6 Definition of size and rsize Values 
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In the prologue routine, the following instruction is used to allocate the stack frame 
(rsp = gr1): | 


prologue: : 
sub rsp,rsp,rsize*4 ; *4 converts words to bytes 


However, this instruction does not account for the fact that there may not be enough 
room in the local registers to contain the activation record. There must be additional 
instructions to detect stack overflow and to cause spilling if overflow occurs. This is 
accomplished by comparing the new value of the register stack pointer with the value of 
the register allocate bound and invoking a trap handler (with vector number V_SPILL) if 
overflow is detected. | 


Furthermore, if the procedure calls another procedure, the prologue must compute a 
frame pointer. The frame pointer will be used by procedures called in turn by the callee 
to insure that the callee’s activation record is in the local registers upon return (i.e., that it 
has not been spilled onto the Register Stack in data memory). The frame pointer is 
computed in the prologue because it need only be computed once, regardless of how 
many procedures are called by a given procedure. 


The complete procedure prologue is then (fp = Ir1): 


prologue: 
sub __srsp, rsp, rsize*4 ; allocate frame 
asgeu V_SPILL, rsp, rab ; Call spill handler if needed 
add fp, rsp, size*4 ; compute frame pointer 
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Spill Handler 


lf overflow occurs, the assert instruction in the prologue fails, causing a trap. The trap 
handler invokes a User-mode routine in the trapping process to spill Register Stack 
locations from the local registers to external memory. Having most of the spill handling in 
a User-mode routine minimizes the amount of time that interrupts are disabled and 
insures that spilling is performed using the correct virtual-memory configuration. 


The spill handler uses two registers. The first register, Global Register 121, normally 
contains a trap handler argument (tav), but is used by the spill handler as a temporary 
register. The second register, Global Register 122, stores a trap handler return address 
(tpc). This register is used by the User-mode spill handler to return to the trapping 
procedure. It is assumed that the address of the User-mode spill handler is contained in 
a global register, denoted user_spill_reg in the following instruction sequence. 


The complete spill handler is: 


Spill: ; operating-system routine 
mfsr tpc, PC1 ; save return address 
mtsr PC1, user_spill_reg ; branch to User spill via interrupt return 
add tav, user_spill_reg, 4 
mtsr PCO, tav 
iret 
user_spill: ; User-mode spill handler 
sub tav, rab, rsp ; compute spill: allocate bound — rsp 
sri tav, tav, 2 ; shift to get number of words 
sub tav, tav, 1 ; count is one less 
mtsr CR, tav ; set Count Remaining Register 
sub tav, rab, rsp 
sub tav, rib, tav ; compute new free bound 
add rab, rsp, 0 ; adjust allocate bound 
storem 0, 0, Ir0, tav ; Spill 
jmpi tpc ; return to trapping procedure 
add rfb, tav, O ; adjust free bound 


Return Values 


If the called procedure returns one or more results, the first 16 words of the result(s) | 
are returned in Global Register 96 through Global Register 111, starting with Global 
Register 96. 


If more than 16 words are required for the results, the additional words are returned in 
memory locations allocated by the caller. In this case a large return pointer (/rp) provided 
by the caller in Global Register 123 at the time of the call points to the seventeenth word 
of the results, and subsequent words are stored at higher memory addresses. 


Procedure Epilogue 


The procedure epilogue deallocates the stack frame allocated by the procedure 
prologue and returns to the calling procedure. Stack deallocation is accomplished by 
adding the rsize value back to the register stack pointer, after which the deallocated 
registers are no longer used and are considered invalid. The epilogue also detects stack 
underflow and causes register filling if underflow occurs. This is accomplished by 
comparing the value of the caller’s frame pointer with the register free bound and 
invoking a trap handler (with vector number V_FILL) if underflow is detected. Finally, the 
epilogue returns to the caller using the caller’s return address. 
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The complete procedure epilogue is: 


epilogue: 
add rsp, rsp, rsize*4 > add back rsize count 
nop - cannot reference a local register here 
asleu =~ V_FILL, fp, rfb ; call fill handler if needed 
jmpi [rd ; Jump to return address 
nop ; delay slot — 


Fill Handlers 


If underflow occurs, the assert instruction in the epilogue fails, causing a trap. The trap 
handler invokes a User-mode routine in the trapping process to fill Register Stack 
locations from the external memory to local registers. The fill handler is similar in 
organization to the spill handler discussed above. 


The complete fill handler is: 


Fill: ; operating-system routine 
: mfsr tpc, PC1 ; save return address 
mtsr PC1, user_fill_reg ; branch to User fill via interrupt return 
add tav, user_fill_reg, 4 
mtsr PCO, tav 
iret 
user_fill: ; User-mode fill handler 


const _ tav, (0x80<<2) local register has high bit set 


or tav, tav, rfb ; put starting register number into Indirect 
; Pointer A 

mtsr IPA, tav : 

sub tav, fp, rfb ; compute number of bytes to fill 

add rab, rab, tav adjust the allocate bound 

srl tav, tav, 2 change byte count to word count 

sub tav, tav, 1 make count zero-based 


mtsr CR, tav 
loadm OQ, 0, gr, rfb 
jmpi tpc 

add rfb, {r1, 0 


set Count Remaining register 
fill | 
return to trapping procedure 
adjust the free bound 


Register Stack Leaf Frame 


A leaf procedure is one that does not call any other procedure. The incoming arguments 
of a leaf procedure are already allocated in the calling procedure’s activation-record 
frame, and the leaf routine is not required to allocate locations for any outgoing argu- 
ments, frame pointer, or return address (since it performs no call). Hence, a leaf 
procedure need not allocate a stack frame in the local registers, and can avoid the 
overhead of the procedure prologue and epilogue routines. Instead, a leaf routine can 
use a Set of global registers for local variables; Global Register 96 through Global 
Register 124 are reserved for this purpose (among other purposes). If there is an 
insufficient number of global registers, the leaf procedure may allocate a frame on the 
Register Stack. | 


Local Variables and Memory-Stack Frames 


A called procedure can store its local variables and temporaries in space allocated in the 
Register Stack frame by the procedure prologue. The values are referenced as an offset 
from the rsp base address, using the Stack-Pointer addressing of the local registers. No 
object in a register is aligned on anything smaller than a register boundary, and all 
objects take at least one register. 
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Because there are 128 local registers, the total Register Stack activation-record size 
cannot be greater than 128 words. If the callee needs more space for local variables and 
temporaries, it must allocate a frame on the Memory Stack to hold these objects. To 
allocate a Memory-Stack frame, the procedure prologue decrements the memory stack 
pointer (msp, in gr125). The procedure epilogue deallocates the Memory-Stack frame by 
incrementing the msp. 


A procedure that extends the Memory Stack dynamically (e.g., using alloca()) must 
make a copy of the msp at procedure entry before allocating the Memory-Stack frame. 
The msp is stored in the memory frame pointer (mfp) entry of the activation record in the 
Register Stack. The procedure can then change the msp during execution, according to 
the needs of dynamic allocation. On procedure return, the Memory-Stack frame is 
deallocated using the mfp to restore the msp. A procedure that does not extend the 
Memory Stack dynamically need not have an mfp entry in its activation record. 


The following prologue and epilogue routines are used if there is no dynamic allocation 
of the Memory Stack during procedure execution, but aMemory Stack frame is other- 
wise required (Figure 4-6 contains a diagram of register usage): 


prologue: 

sub rsp, rsp, <rsize>*4 ; allocate register frame 

asgeu V_SPILL, rsp, rab ; Call spill handler if needed 

add fp, rsp, <size>*4 ; compute register frame pointer 

sub msp, msp, <msize> ; allocate memory frame 

; msize = size of memory frame in words 

epilogue: 

add rsp, rsp, <rsize>*4 ; deallocate register frame 

add msp, msp, <msize> ; deallocate memory frame 

jmpi lrO ; return 

asleu V_FILL, fp, rfb ; Call fill handler if needed 


The following prologue and epilogue routines are used if there is dynamic allocation of 
the Memory Stack during procedure execution: 


prologue: 

sub rsp, rsp, <rsize>*4 _ 5 allocate register frame 

asgeu V_SPILL, rsp, rab ; Call spill handler if needed 

add fp, rsp, <size>*4 ; compute register frame pointer 

add Ir{<rsize> — 1}, msp, O ; Save memory frame pointer 

| ; Ir{rsize—1} is last reg in new frame 
sub msp, msp, <msize> ; allocate memory frame, 
| ; msize = size of memory frame in words 

epilogue: 

add msp, Ir{<rsize> — 1},0 ; restore memory stack pointer 

| ; deallocate memory frame 

add rsp, rsp, <rsize>*4 ; deallocate register frame 

nop ; Cannot reference a local register here 

jmpi IrO ; return 

asleu _—*V_FILL, fp, rfb ; Call fill handler if needed 


Static Link Pointer 


Some programming languages permit nested procedure declarations, introducing the 
possibility that a procedure may reference variables and arguments that are defined and 
managed by another procedure. This other procedure is a static parent of the callee. A 
static parent is determined by the declarations of procedures in the program source and 
is not necessarily the calling procedure; the calling procedure is the dynamic parent. 
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Since procedures can be nested at a number of levels, a given procedure may have a 
number of hierarchically organized static parents. 


A called procedure can locate its dynamic parent and the variables of the dynamic 
parent because of the return address and frame pointer in the Register Stack. However, 
these are not adequate to locate variables of the static parent that may be referenced in 
the procedure. If such references appear in a procedure, the procedure must be 
provided with a static link pointer (s/p). In the run-time organization, the s/p is stored in 
Global Register 124. Since there can be a hierarchy of static parents, the sip points to 
the sip of the immediate parent, which in turn points to the s/p of its immediate parent, 
and so on. Note that the contents of Global Register 124 may be destroyed by a 
procedure call, so a procedure needing to reference the variables of a static parent may 
need to preserve the sip until these references are no longer necessary. 


Transparent Procedures 


A transparent procedure is one that requires very little overhead for managing run-time 
storage. Transparent procedures are used primarily to implement compiler-specific 
support functions, such as integer divide. 


A transparent routine does not allocate any activation-record frames. Parameters are 
passed to a transparent procedure using tav and the Indirect Pointer A, B, and C 
registers. The return address is stored in toc. This convention allows a leaf procedure to 
call a transparent procedure without changing its status as a leaf procedure. There isa 


- tight relationship between a compiler and the transparent procedures it calls. Some 


transparent procedures may need more temporary registers and the compiler must 
account for this. 


REGISTER USAGE CONVENTION 

The run-time organization standardizes the uses of the local and global registers. This 
section summarizes register use and the nomenclature for register values: 

a GR1: Register stack pointer (rsp) 

=m GR2-GR63: Unimplemented 

m GR64-—GR95: Reserved for operating-system use 

- 


GR96-GR111: Procedure return values. Lower-numbered registers are used before 
higher-numbered registers. If more than 16 words are needed, the additional words 
are stored in the Memory Stack (see GR123, large return pointer). These registers 
are also used for temporary values that are destroyed upon a procedure call. 


@ GR112-GR115: Reserved for programmer. These registers are not used by the com- 
piler, except as directed by the programmer. 


m GR116-GR120: Compiler temporaries 


m GR121: Trap handler argument/temporary (tav)—This register is used to communi- 
cate arguments to a software-invoked trap routine. It can be destroyed by the trap, | 
but not by other traps and interrupts not explicitly setae by the program (for 
example, a Timer trap). 


m@ GR122: Trap handler return address/temporary (fpc). This register is also used by 
software-invoked traps. It can be destroyed by the trap, but not by other traps and 
interrupts not explicitly generated by the program (for example, a Timer trap). - 


mw GR123: Large return pointer/temporary (/rp) 
m GR124: Static link pointer/temporary (sip) 
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~GR125: Memory stack pointer (msp) 
GR126: Register allocate bound (rab) 
GR127: Register free bound (rfb) 


LRO: Return address 
LR1: Frame pointer (fp) 


In this convention, registers must be handled by software according to system require- 
ments. The following practices are recommended: 


m GR64—GR95 should be protected from User-mode access by the Register Bank 
Protect Register. 


m= The contents of GR96—GR124 should be assumed destroyed by a procedure call, 
~ unless the procedure is a transparent procedure. 


= The contents of GR121 and GR122 should be assumed destroyed by any procedure 
call or any program-generated trap. 


m= The contents of GR125 are always preserved by a procedure call. 


@ The contents of GR126 and GR127 are managed by the spill and fill handlers and 
should not be modified except by these handlers. 


COMPLEX PROCEDURE CALL EXAMPLE 


The following code sequence demonstrates a complex procedure call, illustrating how 
registers are used in the run-time organization: 


caller: 
(other code) 
add Irp, msp, 32 ; pass Irp 
add slp, msp, 120 ; pass a static link 
call lrO, callee 
const [r2, 1 ; 1 as first argument 
(other code) 
Callee: 
const __ tav, (126—2)*4 ; maximum register allocation 
sub rsp, rsp, tav ; allocate register frame 
asgeu. V_SPILL, rsp, rab ; assert will be taken 
const __ tav, (126—2)*4 + (3*4) ; incoming arguments and overhead 
add fp, rsp, tav ; create frame pointer 
add Iri23, msp, O ; for dynamic Memory-Stack allocation 
const __tav, memory_frame_size _; big msize (>65535 bytes) 
consth tav, memory_frame_size _; high half of msize 
sub msp, msp, tav ; allocate memory frame 
add Ir18, Irp, 0 ; save Irp for later 
add Ir19, slp, O ; save slp for later 
(other code) | 
add msp, Ir123, O ; deallocate memory frame 
const _ tav, (126-2)*4 _; maximum allocation size 
add rsp, rsp, tav ; deallocate register frame 
const 196, 1 ; return value 
jmpi Ird ; return to caller | 
asleu ; ensure caller’s registers in frame 


V_FILL, fp, rfb 
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TRACE-BACK TAGS 


A trace-back tag is either one or two words of information included at the beginning of 
every procedure. This information permits a debug routine to determine the sequence of 
procedure calls and the values of program variables at a given point in execution. The 
trace-back tag describes the memory frame size and the number of local registers used 
by the associated procedure. A one-word tag is used if the memory frame size is less 
than 2K words; otherwise, the two-word tag is used. Regardless of tag length, the tag 
directly precedes the first instruction of the procedure. Figure 4-7 shows the format of the 
trace-back tags. 


Trace-Back Tags 


One-word tag: 


31 23 15 | 7 ) 


Two-word tag: 
31 23 15 7 0 








The first word of a trace-back tag starts with the invalid operation code 00 (hexadecimal). 
This unique, invalid instruction operation code allows the debugger to locate the beginning 
of the procedure in the absence of other information related to the beginning of the 
procedure, such as from a symbol table. This is particularly useful after a program crash, 
in which case the debug routine may have only an arbitrary instruction address within a 
procedure. The call sequence up to the current point in execution can be determined from 
the argcount and msize values in the trace-back tag. However, for procedures that perform 
dynamic stack allocation (e.g., using alloca()), the memory frame pointer must be used. 


The tag word immediately preceding a procedure contains the following fields. Reserved 
fields must be zero. , 


1-Word 2-Word 


Tag Bits Tag Bits — Item Description 
31-24 31-24 (word 2) opcode 00h (an invalid opcode) 
23 23.(word 2) tagtype =  O=one-word tag; 1=two-word tag 
22 22 ad 2). f O=no mfp; 1=mfp used 
21 21 (word 2) Transparent 0=normal;1=transparent procedure 
20-16 20-16 (word 2) argcount Number of arguments in 

registers (including IrO and Ir1) 
15-11 15-0 eee 2 reserved Reserved, must be zero 
10-3 31-2 (word 1 msize Memory frame size in doublewords 
2-0 1-0 (word 1) - reserved Reserved, must be zero 


If the procedure uses a Memory-Stack frame size 2K words or more, the msize field is 
contained in the second tag word immediately preceding the first tag word. 
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This chapter offers a general overview of the internal operation of the pipeline, to help 
the programmer understand how the pipeline affects the program execution and the 
microcontroller's behavior under certain conditions. 


The operation of the functional units is coordinated by Pipeline Hold mode, which 
ensures that operations are performed in the proper order. In certain cases, the pipeline 
is exposed during instruction execution, because execution of certain instructions is 
dependent on the execution of previous instructions. This chapter discusses the cases 
where the pipeline is exposed to software and describes the resulting effect on instruc- 
tion execution. 


5.1 FOUR-STAGE PIPELINE 


The Am29200 and Am29205 microcontrollers implement a four-stage pipeline for 
instruction execution. The four stages are fetch, decode, execute, and write-back. For 
operations, the pipeline is organized so the effective instruction-execution rate may be 
as high as one instruction per cycle. 


During the fetch stage, the instruction fetch unit determines the location of the next 
processor instruction and issues the instruction to the decode stage. The instruction is 
fetched from an external instruction memory. 


During the decode stage, the instruction issued from the fetch stage is decoded, and the 
required operands are fetched and/or assembled. Addresses for branches, loads, and 
stores are also evaluated. 


During the execute stage, the execution unit performs the operation specified by the 
instruction. 


During the write-back stage, the results of the operation performed during the execute 
stage are stored. In the case of branches, loads, and stores, an address is transmitted to 
a memory or a peripheral. 


Most pipeline dependencies internal to the processor are handled by forwarding logic in 
the processor. For those dependencies that result from the external system, the Pipeline 
Hold mode ensures proper operation. 


In a few special cases, the processor pipeline is exponee to software executing on the 
microcontroller (see Sections 5.4, 5.5, and 5.6). 


5.2 PIPELINE HOLD MODE 


The Pipeline Hold mode is activated whenever sequential processor operation cannot be 
guaranteed. When this mode is active, the pipeline stages do not advance, and most 
internal processor state is not modified. 


The processor places itself in the Pipeline Hold mode in the following situations: 


m The processor requires an instruction that has either not been fetched or not been 
returned by the external instruction memory. 
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m The processor requires data from an in-progress load and the operation has not 
completed. 


m The processor attempts to execute a load or store instruction while another load or 
store is in progress. 


m The processor must perform a serialization operation as described in Section 5.3. 


m™ The processor is performing a sequence of load-multiple or store-multiple accesses. 
The Pipeline Hold mode in this case prevents further instruction execution until the 
completion of the load-multiple or store-multiple sequence. 


m The processor has taken an interrupt or trap, and the first instruction of the interrupt 
or trap handler has not entered the execute stage. The Pipeline Hold mode in this 
case prevents the processor pipeline from advancing until the interrupt or trap handler 
can begin execution. 


m The processor has executed an interrupt return, and the target instruction of the inter- 
rupt return has not entered the execute stage. The Pipeline Hold mode in this case 
prevents the processor pipeline from advancing until the interrupt return sequence is. 
complete. : 


The Pipeline Hold mode is exited whenever the causing conditions no longer exist, or 
when the WARN or RESET input is asserted. 








SERIALIZATION 


The Am29200 and Am29205 microcontrollers overlap external data references with 
other operations. When an external data reference might have to be restarted, however, 
the processor context must be the same as when the operation was first attempted. To 
insure this, certain operations are serialized. 


The processor serializes by entering the Pipeline Hold mode in any of the following 
circumstances: 


m™ An external access is not yet completed, and one of the following instructions is en- 
countered: 
Move to Special Register (MTSR) - 
Move to Special Register Immediate (MTSRIM) 
Move to TLB (MTTLB)—even though this performs no operation 
Interrupt Return (IRET) 
Interrupt Return and Invalidate (IRETINV) 
Halt (HALT) 


m An external access is not yet completed, and an interrupt or trap, other than a WARN 
trap, is taken. , 


If the processor is in the Pipeline Hold mode due to serialization, it enters the Executing 
mode once the external access is completed. 


DELAYED BRANCH 


The effect of jump and call instructions is delayed by one cycle to allow the processor 
pipeline to achieve maximum throughput. When one of these branches is successful, the 
instruction immediately following the jump or call is executed before the target instruction 
of the jump or call is executed. Jump and call instructions collectively are referred to as 
delayed branches, and the instruction immediately following is called the delay instruc- 
tion (sometimes referred to as a delay slot). 
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For example, in the following code fragment: 


cpeq grQ6, Ir6, Ir7 (1) 
jmpf gr96, label (2) 
sub Ir6, Ir6, 1 (3) 
const 16,0 — (4) 
label: call lrO, sort (5) 
add | Ir2, Ir5, 0 (6) 


cpneq Ir3, gr96, 0 (7) 


The SUB instruction (3) is executed regardless of the outcome of the JMPF instruction 
(2). Of course, if the JMPF is not successful, the CONST instruction (4) is also executed. 
If the JMPF is successful, then the instruction sequence is: (2), (3), (5), (6), and then the 
_ first instruction of the sort procedure. Note that the CALL instruction (5) is also a delayed 
branch, so the instruction immediately following it, (6), is always executed. After the sort 
procedure executes the return sequence, the CPNEQ instruction (7) is the next instruc- 
tion executed. 


The benefit of delayed branches is improved performance and a simplified processor 
implementation. Performance is improved because the processor pipeline executes 
useful instructions in a larger number of cycles, compared to an implementation without 
delayed branches. | . 


For example, ignoring all other effects on performance and assuming 15% of all 
instructions are taken branches, then a processor without delayed branches would take 
at least two cycles for 15% of its instructions, leading to 0.85(1) + 0.15(2) = 1.15 cycles 
per instruction, on average. This represents a 15% performance degradation compared 
to a processor with delayed branches (assuming, for this simple example, the delay 
instruction is always useful). 


The cost of having delayed branches is either the extra effort required when the compiler 
takes advantage of delayed branches (by re-organizing code), or the extra NO-OP 
instruction that the compiler inserts after every branch to guarantee correct program | 
operation. Since the compiler expends only a small amount of effort to avoid wasting 
time and space with NO-OPs, and since the performance improvement resulting from 
this effort is significant, delayed branches are beneficial overall. 


When two immediately adjacent branches are taken, the target of the first branch 
pre-empts execution of the delay cycle of the second branch, and the target of the 
second branch then follows the target of the first branch. For example, in the following 
code fragment: 


‘jmp L1 : (1) 
jmp L2 (2) 
add Ir4, Ir4, 15 (3) 
Li: sub gr96, gr96, 1 (4) 


subc - gr97, gr97, 0 (5) 
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L2: const gr100, OxffOf (6) 
subr gr101, gr101, 1 (7) 


or | gr100, gri00, gri01 = (8) 


an unconditional JMP instruction (1) is followed immediately by another unconditional 


JMP instruction (2). (In this example, unconditional JMPs are used; however, any two 


immediately adjacent taken branches exhibit the same behavior.) The sequence of 
executed instructions in this case is: JMP instruction (1), JMP instruction (2), SUB 
instruction (4), CONST instruction (6), SUBR instruction (7), OR instruction (8), and so 
on. Note that the ADD instruction (3) is not executed. Also, the target of the first JMP 
instruction (1) was merely visited; control did not continue sequentially from L1, but 
rather continued from L2. 


OVERLAPPED LOADS AND STORES 

The Am29200 and Am29205 microcontrollers overlap external data references with 
other operations. Certain programming practices are necessary to exploit this parallelism 
to improve program performance. 


In order to make full use of overlapped storage accesses, some instruction reorganiza- 
tion may be necessary. For example, in the following sequence: 


loop: 
sll gr121, gr119, 2 (1) 
add gr121, gr120, gr121 = (2) 
load 0, 0, gr121, gri21 (3) 
add gr96, gr96, gri21 (4) 
sub gr98, gr98, 3 (5) 
add gri19, gr119, 1 (6) 
cplt | gri22, gr119, Ir2 (7) 
jmpt gr122, loop (8) 


nop | (9) 


the ADD instruction (4) uses the result of the LOAD instruction (3). However, the 
following four instructions do not depend on the result of the LOAD. Therefore, the ADD 
instruction (4) can be moved past the JMPT (8), since it always will be executed even 
ifthe JMPT is taken, and can replace the NO-OP instruction (9). The resulting sequence 
is: 


loop: 
sll gr121, gr119, 2 (1) 
add | gri2i1, gr120, gri21 = (2) 
load 0, 0, gr121, gr121 (3) 
sub gr98, 998, 3 (4) 
add gr119, gr119, 1 (5) 
cplt gr122, gr119, Ir2 (6) 
jmpt gr122, loop (7) 


add gr96, gr96, gr121 (8) 


The instructions (4) through (7) are likely to be executed while external memory satisfies 
the load request, resulting in improved throughput. The processor thus allows parallel- 
ism to be exploited by instruction reordering. | 
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The overlapped load feature may be used to improve processor performance, but 


imposes no constraints on instruction sequences, as delayed branches do. The proces- 
sor implements the proper pipeline interlocks to make this parallelism transparent to a 
running program. 


DELAYED EFFECTS OF REGISTERS 


The modification of some registers has a delayed effect on processor behavior, because 
of the processor pipeline. The affected registers are the Stack Pointer (Global Register 
1), Indirect Pointers A, B, and C, and the Current Processor Status Register. 


An instruction that writes to the Stack Pointer can be followed immediately by an 
instruction that reads the Stack Pointer. However, any instruction that references a local 
register also uses the value of the Stack Pointer to calculate an absolute-register 
number. At least one cycle of delay must separate an instruction that updates the Stack 
Pointer and an instruction that references a local register. In most systems, this affects 
procedure call and return only (see Section 4.2). In general, though, an instruction that 
immediately follows a change to the Stack Pointer should not reference a local register 
(however, note that this restriction does not apply to a reference of a local register via an 
indirect pointer). 


The indirect pointers have an implementation similar to the Stack Pointer and exhibit 
similar behavior. At least one cycle of delay must separate an instruction that modifies 
an indirect pointer and an instruction that uses that indirect pointer to access a register. 


Note that it normally is not possible to guarantee that the delayed effect of the Stack 
Pointer and indirect pointers is visible to a program. If an interrupt or trap is taken 
immediately after one of these registers is set, then the interrupted routine sees the 
effect of the setting in the following instruction, because many interrupt or trap execution 
cycles elapse between the two instructions of the interrupted routine. For this reason, a 
program should not be written in a manner that relies on the delayed effect; the results of 
this practice may be unpredictable. 


If the Freeze (FZ) bit of the Current Processor Status Register is reset from 1 to 0, two 
cycles are required before all program state is reflected properly in the registers affected 
by the FZ bit. This implies that interrupts and traps cannot be enabled until two cycles 
after the FZ bit is reset, for proper sequencing of program state. There is no delay 
associated with setting the FZ bit from 0 to 1. 
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G SYSTEM PROTECTION 


The Am29200 and Am29205 microcontrollers provide protection for general-purpose 
registers and special-purpose registers. Certain processor operations are also protected. 
This chapter describes the processor’s protection mechanisms. 


6.1 USER AND SUPERVISOR MODES 


At any given time, the microcontroller operates in one of two mutually exclusive program 
modes: the Supervisor mode or the User mode. All system-protection features of the 
microcontroller are based on the difference between these two modes. 


6.1.1 Supervisor Mode 


The processor operates in the Supervisor mode whenever the Supervisor Mode (SM) bit 
of the Current Processor Status Register is 1 (see Section 16.2.1). In the Supervisor 
mode, executing programs have access to all processor resources. 


Any attempt to access a special-purpose register in the range of 160 to 255 causes a 
Protection Violation to occur in either Supervisor or User mode. This permits virtualiza- 
tion of these registers. Supervisor-mode accesses are permitted for any general-pur- 
pose register, regardless of protection. 


6.1.2 User Mode 


The processor operates in the User mode whenever the SM bit in the Current Processor 
Status Register is 0. In the User mode, any of the following actions by an executing 
program causes a Protection Violation trap to occur: 


m An attempted access of any general-purpose register for which a bit in the Register 
Bank Protect Register is 1 (see Section 6.2). 


= An attempted execution of one of the following instructions: Interrupt Return, Interrupt 
Return and Invalidate, Invalidate, or Halt. However, a hardware-development system 
can disable protection checking for the Halt instruction, so this instruction may be 
used to implement instruction breakpoints in User-mode programs (see Sections 17.3 
and 17.7.5). 


m An attempted access of special-purpose register in the range of 0 to 127 or 
160 to 255. 


m An attempted execution of an. assert or EMULATE instruction that specifies a vector 
number between 0 and 63, inclusive (see Section 16.3.2). 


6.2 REGISTER PROTECTION 


General-purpose registers are divided into register banks and are protected by the 
Register Bank Protect Register. The Register Bank Protect Register allows parameters 
for the operating system to be kept in general-purpose registers and protected from 
corruption by User-mode programs. Register banks consist of 16 registers (except for 
Bank 0, which contains Registers 2 through 15) and are parutioned according to 
absolute- “register numbers, as shown in Figure 6-1. 
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Figure 6-1 Register Bank Organization 


Register Bank Absolute-Register General-Purpose 
Protect Register Bit Numbers Registers 


2 through 15 Bank 0 (not implemented) 


i=) 


16 through 31 Bank 1 (not implemented) 
32 through 47 Bank 2 (not implemented) 
48 through 63 Bank 3 (not implemented) 
64 through 79 Bank 4 

80 through 95 Bank 5 

96 through 111 Bank 6 

112 through 127 Bank 7 

128 through 143 Bank 8 

144 through 159 Bank 9 

160 through 175 Bank 10 

176 through 191 Bank 11 

192 through 207 Bank 12 

208 through 223 Bank 13 

224 through 239 Bank 14 

240 through 255 Bank 15 


1 
2 
3 
4 
5 
6 
7 
8 
9 
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The Register Bank Protect Register contains 16 protection bits, where each bit controls 
User-mode accesses (read or write) to a bank of registers. Bits O—-15 of the Register 
Bank Protect Register, protect Register Banks 0 through 15, respectively. 


When a bit in the Register Bank Protect Register is 1 and a register in the corresponding 
bank is specified as an operand register or result register by a User-mode instruction, a 
Protection Violation trap occurs. Note that protection is based on absolute-register 
numbers. In the case of local registers, Stack-Pointer addition is performed before 
protection checking. 


When the processor is in the Supervisor mode, the Register Bank Protect Register has 
no effect on general-purpose register accesses. 


6.2.1 Register Bank Protect Register (RBP, Register 7) 
This protected special-purpose register (Figure 6-2) protects banks of general-purpose 
registers from User-mode program accesses. 


The general-purpose registers are partitioned into 16 banks of 16 registers each (except 
that Bank 0 contains 14 registers). The banks are organized as shown in Figure 6-1. 
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Register Bank Protect Register 
31 . 23 





Bits 31-16: Reserved _ 


Bits 15—0: Bank 15 through Bank 0 Protection Bits (B15—B0)—In the Register Bank 
Protect Register, each bit is associated with a particular bank of registers, and the 

bit number gives the associated bank number (e.g., B11 determines the protection for 
Bank 11). 
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The Am29200 and Am29205 microcontrollers significantly reduce system cost because 
each microcontroller integrates many system functions onto a single chip. This chapter 

overviews the system interfaces and on-chip peripherals of the Am29200 and Am29205 
microcontrollers. 


7.1 SIGNAL DESCRIPTION 


The Am29200 microcontroller uses 140 pins for signal inputs and outputs. It uses 28 | 
pins for power and ground. 


The Am29205 microcontroller uses 84 pins for signal inputs and outputs. It uses 16 pins 
for power and ground. Section 7.1.12 summarizes the signal differences between the 
Am29200 and Am29205 microcontrollers. 


Note: The UCLK signal must be tied High if the serial port is not used. The TRST signal 
must be tied to RESET, whether or not the JTAG port is used. See Appendix A for other 
important hardware configuration notes. 





7.1.1 Clocks a | 
INCLK Input Clock (input) 
_ This ts an oscillator input at twice the processor and system operating © 
frequency. It can be driven at TTL levels. 


MEMCLK Memory Clock (output) 
This is a clock output at one-half of the frequency of INCLK. Most proces- 
sor outputs, and many inputs, are synchronous to MEMCLK. MEMCLK 
drives out with CMOS levels. - 


7.1.2 Processor Signals 


A23—-A0 Address Bus (output, synchronous) 
The address bus supplies the byte address for all accesses, except for 
DRAM accesses. For DRAM accesses, multiplexed row and column 
addresses are provided on A14—A1. A2—A0 are also used to provide a 
clock to an optional burst-mode EPROM. The signals A23—A22 and 
burst-mode devices are not supported on the Am29205 microcontroller. 


ID31-IDO Instruction/Data Bus (bidirectional, synchronous) 
The instruction/data bus (ID bus) transfers instructions to, and data to and 
from the processor. The signals 1D15-IDO are not supported on the 
Am29205 microcontroller. 


WAIT Add Wait States 
(input, synchronous, weak internal pull-up transistor) 
External accesses are normally timed by the Am29200 microcontroller. 
However, the WAIT signal may be asserted during a PIA, ROM, or DMA 
access to extend the access indefinitely. The WAIT pin is not available on 
the Am29205 microcontroller; see the WAIT/TRIST signal description. 
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WAIT/TRIST Add Wait States/Three-State Control! 








(input, synchronous, weak internal pull-up transistor) 

The WAIT signal may be asserted during a PIA, ROM, or DMA access to 
extend the access indefinitely. The WAIT/TRIST pin also used for three- 
state control during test. When asserted during a processor reset, all 
output pins go into a high impedance state. For normal operation, this pin ~ 
must be pulled High during processor reset. This pin is not available on 
the Am29200 microcontroller; see the WAIT signal description. 


Read/Write (output, synchronous) 
During an external ROM, DRAM, DMA, or PIA access, this signal indi- 














cates the direction of transfer: High for a read and Low for a write. 


Reset (input, asynchronous) 

This input places the processor in the Reset mode. This signal has spe- 
cial hardening against metastable states, allowing it to be driven witha 
slow-rise-time signal. 


Warn (input, asynchronous, edge-sensitive, internal pullctp) 

A High-to-Low transition on this input causes a non-maskable WARN trap 
to occur. This trap bypasses the normal trap vector fetch sequence, and 
is useful in situations where the vector fetch may not work (e.g., when 
data memory is faulty). This signal has special hardening against metast- 
able states, allowing it to be driven with a slow-transition-time signal. This 
signal is not supported on the Am29205 microcontroller. 


INTR3-INTRO Interrupt Requests 3-0 (input, asynchronous, internal pull-ups) 


STAT2-STATO 


These inputs generate prioritized interrupt requests. The interrupt caused 
by INTRO has the highest priority, and the interrupt caused by INTRS3 has» 
the lowest priority. The interrupt requests are masked in prioritized order 
by the Interrupt Mask field in the Current Processor Status Register and 
are disabled by the DA and DI bits of the Current Processor Status Regis- 
ter. These signals have special hardening against metastable states, 
allowing them to be driven with slow-transition-time signals. The 
INTR1I-INTRO signals are not supported on the Am29205 microcontroller. 








CPU Status (output, synchronous) 
These outputs indicate information about the processor or the current 
access for the purposes of hardware debug. They are encoded as follows: 


STAT2 STAT1 STATO Condition 


Halt or Step mode 

Interrupt/trap vector fetch ee: valid) 

Load Test Instruction mode, Halt/Freeze 
Branch target fetch (instruction valid) 
External data access (data valid) 

External instruction access (instruction valid) 
Internal peripheral access (data valid) 

Idle or data/instruction not valid 


—~i§ OO $$ +4 0 O 
=-—O=- 0-0 © 


Note that in all cases, a condition is reflected on the STAT pins on the 
second cycle following the condition. These signals are described in more 


. detail in Section 17.4. The STAT2—STATO er are not supported on 


the Am29205 microcontroller. 
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TRAP1—TRAPO . 

Trap Requests 1-0 (input, asynchronous, internal pull-ups) 

These inputs generate prioritized trap requests. The trap caused by 
TRAPO has the highest priority. These trap requests are disabled by the 
DA bit of the Current Processor Status Register. These signals have 
special hardening against metastable states, allowing them to be driven 
with slow-transition-time signals. These magne are not ie al on the 
Am29205 microcontroller. 


- 7.1.3 | ROM Interface 


ROMCS3—ROMCSO 
ROM Chip Selects, Banks 3-0 (output, synchronous) | 
A Low level on one of these signals selects the memory devices in the 
corresponding ROM bank. ROMCSS selects devices in ROM Bank 3, and 
so on. The timing and access parameters of each bank are individually 
programmable. ROMCSS is not supported on the Am29205 microcontroller. 


ROMOE ROM Output Enable (output, synchronous) 
This signal enables the selected ROM Bank to drive the ID bus. Iti is used 
_ to prevent bus contention when switching between different ROM banks 
or switching between a ROM bank and another device or DRAM bank. 


BURST Burst-Mode Access (output, synchronous) 
This signal is asserted to perform sequential accesses from a burst-mode 
device. This signal is not supported on the Am29205 microcontroller. 








RSWE ROM Space Write Enable (output, synchronous) 
This signal is used to write an alterable memory in a ROM bank (such as 
an SRAM or Flash EPROM). RSWE supports only writes of width equal to 
_or greater than the width of the memory, and the memory must be at least 
16 bits wide. The CASx signals, described in Section 7.1.4, serve as 
individual byte strobes for writes to the ROM space, if ROM byte writes 
are enabled. 


BOOTW Boot ROM Width (input, asynchronous) 
This input configures the width of ROM Bank 0, so the ROM can be ac- 
cessed before the ROM configuration has been set by the system initial- 
ization software. The BOOTW signal is sampled during and after a pro- 
cessor reset. If BOOTW is High before and after reset (tied High), the 
boot ROM is 32 bits wide. lf BOOTW is Low before and after reset (tied 
Low), the boot ROM is 16 bits wide. If BOOTW is Low before reset and 
High after reset (tied to RESET ), the boot ROM is 8 bits wide. This signal 
has special hardening against metastable states, allowing it to be driven 
with a slow-rise-time signal and permitting it to be tied to RESET. — 


This signal is not supported on the Am29205 microcontroller. ROM Bank 
O is set to 16 bits during a processor reset; this setting cannot be 
changed. 








7.1.4 DRAM Interface 


RAS3-RASO Row Address Strobe, Banks 3-0 (output, synchronous) 
A High-to-Low transition on one of these signals causes a DRAM in the 
corresponding bank to latch the row address and begin an access. RAS3 
starts an access in DRAM Bank 3, and so on. These signals aise are 
used in other special DRAM cycles. 
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CAS3-CASO Column Address Strobes, Byte 3-0 (output, synchronous) 


TR/IOE 


A High-to-Low transition on these signals causes the DRAM selected by 
RAS3-RASO to latch the column address and complete the access. To 
support byte and half-word writes, column address strobes are provided 
for individual DRAM bytes. CAS3 is the column address strobe for the 
DRAMs, in all banks, attached to ID31-ID24. CAS2 is for the DRAMs 
attached to ID23-ID16, and so on. These signals are also used in other 
special DRAM cycles. 


The CASx signals can be enabled to act as individual byte strobes for 
byte writes to the ROM space. In this configuration, ROM accesses do 
not conflict with DRAM accesses or refresh even though the CASx may 
be used by both the ROM and DRAM. Refresh is delayed during byte 
reads and writes to ROM space. 


The CAS1-CASO signals are not supported on the Am29205 
microcontroller. 


Write Enable (output, asnenrenousl 

This signal is used to write the selected DRAM bank. “Early write” cycles 
are used so the DRAM data inputs and outputs can be tied to the com- 
mon ID bus. 


Video DRAM Transfer/Output Enable (output, synchronous) 

This signal is used with video DRAMs to transfer data to the video shift 
register. It is also used as an output enable in normal video DRAM read 
cycles. This signal is not supported on the Am29205 microcontroller. 


Peripheral Interface peoeter (PIA) 
PIACS5-PIACSO0 








Peripheral Chip Selects, Regions 5-0 (output, synchronous) 

These signals are used to select individual peripheral devices. DMA 
Channel 0 may be programmed to use PIACSO during an external periph- 
eral access, and DMA Channel 1 may be programmed to use PIACS1. 
PIACS5—PIACS2 are not supported on the Am29205 microcontroller. 


Peripheral Output Enable (output, synchronous) 
This signal enables the selected peripheral device to drive the ID bus. 














Peripheral Write Enable (output, synchronous) 
This signal causes data on the ID bus to be written into the selected 
peripheral. 


DMA Controller 
DREQi—DREQO 


DMA Request, Channels 1-0 (input, asynchronous, internal pull-ups) 
These signals request an external transfer on DMA Channel 0 (DREQO) or 
DMA Channel 1 (DREQ1). These requests are individually programmable 
to be either level- or edge-sensitive for either polarity of level or edge. DMA 
transfers can occur to and from internal peripherals independent of these 
requests. DREQO is not supported on the Am29205 microcontroller. 


DACK1—DACKO 


DMA Acknowledge, Channels 1-0 (output, synchronous) 
These signals acknowledge an external transfer on DMA Channel 0 
(DREQO) or DMA Channel 1 (DREQ1). DMA transfers can occur to and 
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TDMA 








1/O Port 
P1015—PIO0 
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from internal peripherals independent of these acknowledgments. DACKO 


_ is not supported on the Am29205 microcontroller. 


Terminate DMA (input, synchronous) | 
This signal can be asserted during an external DMA transfer to terminate 
the transfer after the current access. This signal is not supported on the 
Am29205 microcontroller. 


External Memory Grant Request (input, synchronous, internal pull-up) 
This signal is used by an external device to request an access to the 
Am29200 microprocessor’s ROM or DRAM. To perform this access, the 
external device supplies an address to the Am29200 microcontroller’s 
ROM controller or DRAM controller. 


To support a hardware-development system, GREQ should be either tied 
High or held at a high-impedance state during a processor reset. 


This signal is not supported on the Am29205 microcontroller. 


External Memory Grant Acknowledge (output, synchronous) 

This signal indicates to an external device that ithas been granted an 
access to the Am29200 microcontroller’s ROM or DRAM, and that the 
device should provide an address. This signal is not supported on the 
Am29205 microcontroller. © 


Programmable Input/Output (input/output, asynchronous) 

These signals are available for direct software control and inspection. 
P1O015—PIO8 may be individually programmed to cause processor inter- 
rupts. These signals have special hardening against metastable states, 
allowing them to be driven with slow-transition-time signals. The signals 
PIO7—PIO0 are not supported on the Am29205 microcontroller. © 


Parallel Port 


PSTROBE 


PBUSY 


PACK 


PAUTOFD 


U 
m 


Uv 
m 


Parallel Port Strobe (input, asynchronous) 
This signal is used by the host to indicate that data is on the parallel port 
or to acknowledge a transfer from the microcontroller. 


Parallel Port Busy (output, synchronous) 
This indicates to the host that the parallel port is busy and cannot accept 
a data transfer. 


Parallel Port Acknowledge (output, synchronous) 
This signal is used by the microcontroller to acknowledge a transfer from 
the host or to indicate to the host that data has been placed on the port. 


Parallel Port Autofeed (input, asynchronous) 

This signal is used by the host to indicate how line feeds should be per- 
formed or is used to indicate that the host is busy and cannot accept a 
data transfer. 


Parallel Port Output Enable (output, synchronous) 
This signal enables an external data buffer pepntaining data from the host | 
to drive the ID bus. 


Parallel Port Write Enable (Sutput. synchronous) 
This signal writes a buffer with data on the ID bus. Then, the buffer drives 
data to the host. 
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7.1.9 


7.1.10 


7.1.11 


Serial Port 
UCLK 


_ TXD 


RXD 


DTR 


UART Clock (input) 

This is an oscillator input for generating the UART (serial port) clock. To 
generate the UART clock, the oscillator frequency may be divided by 
any amount up to 65,536. The UART clock operates at 16 times the 
serial port’s baud rate. As an option, UCLK may be driven with MEMCLK 
or INCLK. It can be driven with TTL levels. UCLK must be tied High if 
unused. 


Transmit Data (output, asynchronous) 
This output is used to transmit serial data. 


Receive Data (input, asynchronous) 
This input is used to receive serial data. 


Data Set Ready (output, synchronous) 

This indicates to the host that the serial port on the Am29200 microcon- 
troller is ready to transmit or receive data. This signal is not supported on 
the Am29205 microcontroller. 


Data Terminal Ready (input, asynchronous) 

This indicates to the Am29200 microcontroller that the host is ready to 
transmit or receive data. This Signa’! is not supported on the Am29205 
microcontroller. 


Video Interface 


VCLK 


VDAT 
LSYNC 


PSYNC 


Video Clock (input, asynchronous) 


This clock is used to synchronize the transfer of video data. As an option, 
VCLK may be driven with MEMCLK or INCLK. It can be driven with TTL 
levels. 


Video Data (input/output, synchronous to VCLK) 
This is serial data to or from the video device. 


Line Synchronization (input, asynchronous) 
This signal indicates the start of a raster line. 


Page Synchronization (input/output, asynchronous) 
This signal indicates the beginning of a raster page. 


JTAG 1149.1 Boundary Scan Interface (Am29200 Microcontroller) 


TCK 


TMS 


TDI 


Test Clock Input (asynchronous input, internal pull-up) 

This input is used to operate the test access port. The state of the test 
access port must be held if this clock is held either High or Low. This 
clock is internally synchronized to MEMCLK for certain operations of the 
test access port controller, so signals internally driven and sampled by the 
test access port are synchronous to processor internal clocks. This signal 
is not available on the Am29205 microcontroller. 


Test Mode Select (input, synchronous to TCK, internal pull-up) 
This input is used to control the test access port. If it is not driven, it 
appears High internally. This signal is not available on the Am29205 
microcontroller. 


Test Data Input (input, synchronous to TCK, internal pull-up) 
This input supplies data to the test logic from an external source. It is 
sampled on the rising edge of TCK. If it is not driven, it appears High 
internally. This signal is not available on the Am29205 microcontroller. 
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TDO Test Data Output (three-state output, synchronous to TCK) 
This output supplies data from the test logic to an external destination. It 
changes on the falling edge of TCK. It is in the high-impedance state 
except when scanning is in progress. This signal is not available on the 
Am29205 microcontroller. 


TRST _ Test Reset Input (asynchronous input, internal pull-up) 
This input asynchronously resets the test access port. This input places 
the test logic in a state such that no output driver is enabled. The TRST 
input must be asserted in conjunction with the RESET input for correct 
processor initialization, whether or not the JTAG port is used. (See Ap- 
pendix A.) This signal is not available on the Am29205 microcontroller. 








Pin Changes for the Am29205 Microcontroller 


The reduced pin count of the Am29205 microcontroller comes from having a 16-bit 
instruction/data bus, fewer ports on some of the peripherals, and no JTAG interface. The 
following signals supported on the Am29200 microcontroller are not available on the 
Am29205 microcontroller. - 


mw Processor signals: A23—A22, ID15-IDO, WARN, INTR1-INTRO, TRAP1—TRAPO, 
STAT2—-STATO 


ROM interface signals: ROMCS3, BURST, BOOTW 
DRAM interface signals: CAS1-CASO, TR/OE 

PIA signals: PIACSS-PIACS2 

DMA signals: DREQO, DACKO, TDMA, GREG, GACK 
I/O port signals: PIO7-PIOO 

Serial port signals: DSR, DTR 

m JTAG signals: TCK, TDI, TMS, TDO, TRST 


In addition, the Am29200 microcontroller’s WAIT pin is defined as a WAIT/TRIST pin on 
the Am29205 microcontroller. _ 











ACCESS PRIORITY 


Many of the processor interface signals are shared between various types of accesses. 
If more than one access request occurs at the same time, the requests are prioritized as 


follows, in decreasing order of priority: 


1. “Panic mode” DRAM Refresh (see Section 9.3.8) 

2. DMA Channel 0 transfer | 

3. DMA Channel 1 transfer 

4. Memory access request by an external device (see Section 11.5) 
5. Processor DRAM, PIA, or ROM access for data 

6. Processor DRAM or ROM access for an instruction 


External DMA transfers require two accesses: one to read the data from a peripheral or 
the DRAM, and another to write the data to a peripheral or DRAM. The two accesses 
are performed back-to-back, without interruption by another access. 


Some processor accesses to narrow memories (a narrow memory is 8 or 16 bits wide) 
require two or four accesses; for example, reading 32 bits from an 8-bit-wide ROM requires 
four reads. These accesses are also performed back-to-back, without interruption. 
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DRAM refresh cycles are normally overlapped with other, non-DRAM accesses. 
Because normal refresh cycles are performed when there is no conflict with other 
accesses, these cycles are not prioritized in the above list. 


oe) SYSTEM ADDRESS PARTITION | 


All addresses are in the microcontroller’s instruction/data memory address space. The 
address space is partitioned as shown in Table 7-1. : 


Table 7-1 Internal Peripheral Address Ranges 


Address — 
(hexadeximal) 





Maximum Physical Size 


Am29200 Microcontroller Am29205 Microcontroller 

























































































Q0O000000-O3FFFFFF | ROM Banks (all) 64 Mbytes 12 Mbytes 
40000000-43FFFFFF |DRAM Banks (all) 64 Mbytes 32 Mbytes 
50000000-SOFFFFFF |Mapped DRAM Banks (all) 16 Mbytes 16 Mbytes 
60000000-63FFFFFF |VDRAM transfers 64 Mbytes Not Supported 
80000000—800000FC _ | Internal peripherals/controllers — : — 
90000000-90FFFFFF {PIA Region 0 (PIACSO) 16 Mbytes 4 Mbytes 
91000000-91FFFFFF | PIA Region 1 (PIACS1) | 16 Mbytes 4 Mbytes 
92000000-92FFFFFF | PIA Region 2 (PIACS2) 16 Mbytes Not Supported 
93000000-93FFFFFF | PIA Region 3 (PIACS3) 16 Mbytes Not Supported 
94000000-94FFFFFF | PIA Region 4 (PIACS4) 16 Mbytes Not Supported 
95000000-95FFFFFF j|PIA Region 5 (PIACS5) 16 Mbytes Not Supported 





—all others— Reserved 


An access to any unimplemented address or address range has an unpredictable effect 
on processor operation. 


1.4 INTERNAL PERIPHERALS AND CONTROLLERS 


Internal peripheral registers are selected by offsets from address 80000000h. The 
address assignment of the various internal pou and controllers is shown in 
Table 7-2. 


Nearly all registers are read/write and are 32 bits in length. However, a few register bits 
are read only, bits in the Interrupt Control Register are reset-only, and the DMAO 
Address Tail Register and DMAO Count Tail Register are both write-only. It is not 
possible to perform writes on individual bytes or halfwords of any register. Unimplem- 

ented register bits are read as zeros and should be written with zeros to ensure compati- 
bility with future processor versions. 


Three registers have alternates, provided for backward compatibility. The following 
summary shows the preferred and alternate addresses for each of these registers. 


Register | Preferred Address | Alternate Address 
DMAO Address Tail Register 80000070h | 80000036h 
DMAO Count Tail Register 8000003Ch 8000003Ah 
Parallel Port Status Register 800000C8h 800000C 1h 


The alternate DMAO Address Tail Register and the alternate DMAO Count Tail Register 
allow write-only access for compatibility with earlier versions of the Am29200 and 
Am29205 microcontrollers. These two registers are supported for backward 
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compatibility and should not be used for new designs. The DMAO Address Tail Register 
(address 80000070h) and DMAO Count Tail Register (address 8000003Ch) should be 
used instead. 


The alternate Parallel Port Status Register is also provided for compatibility. This register 
should not be used for new designs. The Parallel Port Status Register (address 
800000C8h) should be used instead. 
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Table 7-2 Internal Peripheral Address Assignments 

Address 

Peripheral (hexadecimal) Register 

ROM Controller 80000000 ROM Control Register 
80000004 ROM Configuration Register 

DRAM Controller 80000008 DRAM Control Register 
8000000C DRAM Configuration Register 

DRAM Mapping Unit 80000010 DRAM Mapping Register 0 
80000014 DRAM Mapping Register 1 
80000018 DRAM Mapping Register 2 
8000001C DRAM Mapping Register 3 

Peripheral Interface Adapter 80000020 PIA Control Register 0 
80000024 PIA Control Register 1° 

Interrupt Controller 80000028 Interrupt Contro! Register 

DMA Channel 0 80000030 DMAO Control Register 
80000034 DMAO Address Register 
80000070 DMAO Address Tail Register 
80000038 DMAO Count Register 
8000003C DMAO Count Tail Register 

DMA Channel 1 80000040 DMA1 Control Register 
80000044 DMA1 Address Register 
80000048 DMA1 Count Register 

- Serial Port 80000080 Serial Port Control Register 

80000084 Serial Port Status Register 
80000088 Serial Port Transmit Holding Register 
8000008C Serial Port Receive Buffer Register 
80000090 Baud Rate Divisor Register 

Parallel Port 800000C0 Parallel Port Control! Register 
80000004 Parallel Port Data Register 
800000C8 Parallel Port Status Register 

Programmable I/O Port 800000D0 PIO Control Register 
800000D4 PIO Input Register 
800000D8 ‘PIO Output Register 
800000DC PIO Output Enable Register 

Video Interface 800000E0 Video Control Register 
800000E4 Top Margin Register 
800000E8 Side Margin Register 
800000EC Video Data Holding Register 

Reserved 


—all others— 


Note: ¢ Reserved on the Am29205 microcontroller. 
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Figure 8-1 





ROM CONTROLLER ar 


This chapter describes the operation of the ROM controller. Programmable registers and 
initialization are discussed, along with ROM address mapping, ROM reads and writes, 
burst-mode accesses, and extending ROM cycles. 


OVERVIEW 


The on-chip ROM controller provides a glueless interface to static memory devices 
such as ROMs and EPROMs, as well as alterable devices such as SRAMs, flash 
EPROMs, and memory-mapped peripherals. ROM space on the Am29200 and 
Am29205 microcontrollers is divided into banks, each of which is individually configur- 
able for width and access timing. Programmable registers control the location, size, — 
width, wait-state, and burst capability of each bank. The banks can be arranged to form 
a contiguous memory area. 


The ROM interface on the Am29200 microcontroller accommodates up to four banks of 
ROM. These banks can be 8, 16, or 32 bits wide, with a maximum address space of 16 
Mbytes per bank. 


The Am29205 nilérocontrolie? supports up to three ROM banks; 8- and 16-bit wide 
banks are supported, with a maximum address space of 4 Mbytes per bank. Burst-mode 
ROM access is not supported. Boot ROM width is 16 bits. The signals ROMCSS, 
BURST, and BOOTW are not available on the Am29205 microcontroller. 





PROGRAMMABLE REGISTERS 


ROM Control Register (RMCT, Address 80000000) 


The ROM Control Register (Figure 8-1) controls the access of ROM Banks 0 through 3 
on the Am29200 microcontroller and ROM Banks 0 through 2 on the Am29205 
microcontroller. 


ROM Control Register | 





t seoe 
BSTO LM , res BST1 BST2 BST3 


BWE 
= Reserved on Am29205 microcontroller _ 


Bit 31: Burst-Mode ROM, Bank 0 (BSTO), Am29200 microcontroller—When this bit is 


-1, ROM Bank 0 is accessed using the burst-mode protocol, in which sequential ac- 


cesses are completed at the rate of one access per cycle. When this bit is 0, the 
burst-mode protocol is not used. This bit is reserved on the Am29205 microcontroller. 
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8.2.2 


Bits 30-29: Data Width, Bank 0 (DW0)—This field indicates the width of the ROM in 
Bank 0, as follows: 


DWO ROM Width 
00 32 bits (Reserved on Am29205 microcontroller) 
01 8 bits 
10 : 16 bits | 
11 Reserved 


Bit 28: Large Memory (LM)—This bit controls the size of the ROM banks and the total 
size of the ROM address space. If the LM bit is 0 on either microcontroller, each ROM 
bank is up to 4 Mbytes in size, for placement within a 16 Mbyte address space. 


If the LM bit is 1 on the Am29200 microcontroller, each ROM bank is up to 16 Mbytes in 
size, for placement within a 64-Mbyte address space. If the LM bit is 1 on the Am29205 
microcontroller, each ROM bank is up to 4 Mbytes in size, for placement within a 
64-Mbyte address space. 


Bit 27: Byte Write Enable (BWE)—This bit controls whether or not the CASx signals 
are used as byte strobes during writes to the ROM address space. If BWE is 0, the 
CASx signals are not used during ROM writes (unless there is a hidden refresh at the 
same time). If BWE is 1, the CASx signals are used as byte strobes during a ROM write 
with hidden refresh prohibited during a ROM read or write. 


Bit 26: Reserved 


Bits 25-24: Wait States, Bank 0 (WS0)—This field specifies the number of wait states 
in a ROM access: that is, the number of cycles in addition to one cycle required to 
access the ROM. Zero-wait-state cycles are supported only for non-burst-mode ROM 
reads. Writes to the ROM address space and burst-mode ROMs have a minimum of one 
wait state, even when wait states are programmed at zero. 


* Other bits of this register have a definition similar to BSTO, DWO, and WSO for ROM 


Banks 1 through 3 on the Am29200 microcontroller and ROM Banks 1 through 2 on the 
Am29205 microcontroller. The BSTx bits are not supported on the Am29205 microcon- 
troller. | 


Bits 7-0: Reserved. These bits are reserved on nihe Am29205 microcontroller only. 


ROM Configuration Register (RMCF, Address 80000004) 

The ROM Configuration Register (Figure 8-2) controls the selection of ROM Banks 0 
through 3 on the Am29200 microcontroller and ROM Banks 0 through 2 on the 
Am29205 microcontroller. In most systems, this register should be set by software to 
cause all the banks of ROM to appear as a single, contiguous region of memory. 


Bits 31-27: Address Select, Bank 0 (ASELO)—On a load, store, or instruction access, 
this field is compared against bits of the access address, with the comparisons possibly 
masked by the AMASKO field. The unmasked bits of the ASELO field must match the 
corresponding bits of the address for ROM Bank 0 to be accessed. 


Bits 26—24: Address Mask, Bank 0 (AMASK0)—This field masks the comparison of 
the ASELO field with bits of the address on an access, to permit various sizes of | 
memories and memory chips in ROM Bank 0 (“ad(x:y)” represents a field of address bits 
x through y, inclusive). 
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ROM Configuration Register 





3 23 15 
ASELO AMASKO ASEL1 AMASK1 ASEL2 AMASK2 
/ 


= Reserved on Am29205 microcontroller 








AMASKO Address Comparison Address Comparison 
Value (LM=0) (LM=1) | 
000 ASELO 4:0) to Bn Ese ASELOIA :0) to aa(25:21 
-001 ASELO(4:1) to aa(23:20 ASELO(4:1) to aa(25:22 
011 ASELO(4:2) to ad(23:21 ASELO(4:2) to ac(25:23 tReeerved on Am29205) 
Reserved on Am29205) 


111 ASELO(4:3) to aa(23:22) ASELO(4:3) to aa(25:24 


Only the AMASKO values shown in the above table are valid. The AMASKO field permits 
various sizes of memories and memory chips in ROM Bank 0 that are independent of 
the sizes in the other banks. 


Other bits of this register have a definition similar to ASELO and AMASKO for ROM 
Banks 1 through 3 on the Am29200 microcontroller and ROM Banks 1 through 2 on the 
Am29205 microcontroller. 


Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller only. 
Although ROM Bank 3 is not supported on the Am29205 microcontroller, AMASK3 and 
ASELS3 still exist in the ROM Configuration Register. ASEL3 must be programmed to a 
value that does not overlap with addresses specified for ROM Banks 2 through 0. 


Initialization 


ROM Bank 0 is used as the oe ROM containing the initialization code for the processor 
and peripherals. _ 


On the Am29200 microcontroller, the width of Bank 0 is set by the BOOTW signal, which 
is sampled during and after a processor reset. If BOOTW is High before and after reset 
(tied High), the boot ROM is 32 bits wide. If BOOTW is Low before and after reset (tied 
Low), the boot ROM is 16 bits wide. If BOOTW is Low before reset and High after reset 
(tied to RESET), the boot ROM is 8 bits wide. The BOOTW signal is used to set the 
DWO field before the boot ROM is accessed. 


On the Am29205 microcontroller, the boot ROM width in Bank 0 is 16 bits during 
processor reset. No 8-bit booting is possible since the BOOTW signal is not supported 
on the Am29205 microcontroller. | 


The boot ROM defaults to a non-burst-mode ROM with three wait states until the ROM 
Control Register and ROM Configuration Register are set with the correct configuration. 
The LM bit is reset to 0. The ASELO and AMASKO fields are both set to zero by a 
processor reset. 
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To prevent bank conflicts during initialization, the ASEL and AMASK fields for ROM 
banks 1 through 3 are set to all 1s. The configuration of ROM banks 1 through 3, if 
present, must be set by software before the respective bank is accessed. 


ROM ACCESSES 


ROM Address Mapping 


To map logical memory banks to physical addresses, each ROM bank uses two fields to 
determine the location of the bank in physical memory: AMASK and ASEL. AMASK 
selects the number of address bits decoded and thus the size of a given bank. ASEL 
contains the address bit values compared against the address and thus the location of a. 
given bank. The LM bit controls the maximum size of the banks and the total size of the 


ROM address space as shown. 


AMASK Value Bank Size (LM=0) Bank Size (LM=1) 
000 512 Kbytes 2 Mbytes 
001 1 Mbyte - 4Mbytes 
011 2 Mbytes 8 Mbytes* 
111 4 Mbytes , 16 Mbytes* 


* Am29200 microcontroller only 


The ASEL and AMASK fields allow the three or four ROM banks to appear as a 
contiguous region of ROM, with the restriction that a bank of a certain size must fit on 
the natural address boundary for that size. For example, a 2-Mbyte ROM must be 
placed on a 2-Mbyte address boundary. For this reason, ROM banks must appear in the 
address space in order of decreasing bank size if the banks are to be contiguous. Note 
that to achieve a contiguous memory, the various ROM banks need not appear in 
sequence in the address space. For example, on the Am29200 microcontroller, ROM 
Bank 3 may appear in an address range below the address range for ROM Bank 1 or 2. 
The only restriction in the placement of ROM banks is that ROM Bank 0 is used for the 
initial instruction fetches after a processor reset, starting at address 00000000, hexadeci- 
mal. Setting AMASK to 0 and ASEL to 1Fh reduces the probability of empty banks being 
inadvertently decoded. This configures the bank as small and as high in memory as 
possible. 


Simple ROM Accesses 

Figure 8-3 shows the timing of a simple ROM read cycle. The number of cycles is 
controlled by the WSx field in the ROM Control! Register (“x” represents one of ROM 
Banks 0 through 3). The WSx field specifies the number of wait states: that is, the 
number of cycles beyond one cycle required to access the ROM. 


Figure 8-4 shows the timing of a zero-wait-state ROM read (WSx = 00). In this case, the 
ROMOE signal is asserted at the midpoint of the cycle rather than at the beginning of 
the second cycle (since there is no second cycle). 


Narrow ROM Accesses 


A narrow ROM is one that is less than 32 bits wide. The Am29200 and Am29205 
microcontrollers support 8- and 16-bit-wide ROMs in any bank, as determined by the 
DWx< field in the ROM Control Register. 
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ROMCSx 


ROMOE. 
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8.3.3.1 


Simple ROM Read Cycle 








number of cycles determined by WSx+1 . 
WSx:s3 illustrated ' 


An 8-bit-wide ROM must be attached to ID31-ID24. A 16-bit-wide ROM must be 
attached to |ID31—ID16 and ignores AO. A 32-bit ROM is attached to 1ID31-IDO and 
ignores A1—AO. A narrow ROM can respond to any read access, but the ROM must be 
at least 16 bits wide to respond to writes. Writes to 8-bit memories are not supported and 
may provide unreliable results. 


8-Bit Narrow Accesses 


If the processor expects a half-word or a word on a read (that is, if the access is nota 
byte read), and a narrow ROM is 8 bits wide, the microcontroller generates one (for a 
half-word) or three (for a word) requests immediately following the first access. No 
other intervening accesses are performed. The address for each subsequent access is 
the same as the address for the first access, except that A1—-A0 are incremented by 
one for each access. A burst-mode access may be performed for the subsequent bytes 
if the ROM permits such an access and if the ROM Control! Register is programmed to 
enable burst. | | : , 


The microcontroller assembles the final word or half-word by placing the first received 
byte in the high-order byte position of the word or half-word. The second received byte is 
placed in the next-lower-order byte position and so on until the entire word or half-word 
is assembled. | 


If the read access is a byte access, the processor performs only one access. 


If software generates an unaligned half-word or word read, the narrow ROM does not 
permit the implementation of the unaligned read. The address sequence generated to 
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Figure 8-4 Simple ROM Read Cycle—Zero Wait States 
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RSWE 
ID31-IDO 
assemble the half-word or word wraps within the half-word or word. A trap on unaligned 
access is available and may be used to detect and correct such accesses. | 
8.3.3.2 16-Bit Narrow Accesses 


If the processor expects a word on a read, and a narrow ROM is 16 bits wide, the 
microcontroller generates one more request immediately following the first access. No 
other intervening accesses are performed. The address for the second access is the 
same as the address for the first access, except that A1-A0 are incremented by two for 
the second access. A burst-mode access may be performed for the second 16 bits if the 
ROM permits such an access. 


The microcontroller assembles the final word by placing the first received half-word in 
the high-order half-word position of the word, and the second received half-word in the 
low-order half-word position. | 


If the read access is a byte or half-word access, the processor performs only one 
access. | 


If software generates an unaligned word read, the narrow ROM does not permit the 
implementation of the unaligned read. The address sequence generated to assemble 
the word wraps within the word. A trap on unaligned access is available and may be 
used to detect and correct such accesses. 


8-6 ROM Controller 


8.3.4 
8.3.4.1 


8.3.4.2 


Figure 8-5 


MEMCLK 


A23—A0 


R/AW 


ROMCSx 


ROMOE 


BURST 





RSWE 


ID31~IDO 


AMD a | 
Writes to the ROM Space 


Simple Writes 

Figure 8-5 shows the timing of a simple write to the ROM address space. This cycle is 
provided for alterable memories in the ROM space, such as SRAMs or Flash EPROMs. 
Zero-wait-state cycles are not supported for writes. 


Because of processor limitations; the ROM must be at least 16 bits wide to support 
writes (see Section 8.3.3). If 32-bit data is written into a 16-bit-wide ROM, the processor 
performs two back-to-back uninterrupted accesses. On the first cycle of the second — 
write, the processor drives the data bus with the second 16 bits (that is, in the same 
cycle in which ROMCSx and A23-A0 are asserted). 


Byte Writes | | 


If the BWE bit is set in the ROM Control Register, the processor uses the CASx signals 
as individual byte strobes, to allow byte and half-word writes to the ROM address space. 
Note that this reuse of the CASx signals causes CAS-only cycles to the memories in the 
DRAM banks (if present) during ROM writes and causes spurious write enables to 
non-selected memories in the ROM banks during DRAM accesses. These normally do 
not cause invalid operation. Furthermore, hidden refresh is disabled during ROM reads 
or writes if the BWE bit is set, to prevent invalid interference between simultaneous ROM 
and DRAM cycles. Thus, one slight disadvantage of using ROM byte writes is that there 


are fewer hidden refresh cycles and hence slightly degraded system performance. 


Simple Write to ROM Bank | 
(for alterable memories in the ROM address space) 





number of cycles determined by WSx+1 
ie WSxs3 illustrated ro 
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The GAS3-CASO signals on the Am29200 microcontroller are used to write individual 
bytes for a 32-bit-wide ROM bank as follows: 


Data width A1-A0. CAS3-CASO (on write) 
Bbits 00 | O111 
8 bits 01 1011 
8 bits 10 1101 
8 bits 11 1110 
16 bits Ox 0011 
16 bits 1x . 1100 


32 bits . XX 0000 


The CAS3-CASO signals are used to write individual bytes for a 16-bit-wide bank (that 
is, a narrow bank) as follows: 


Data width A1-A0 CAS3-CAS0 (on write) 
8 bits 00 0114 
8 bits 01 1011 
8 bits 10 . O11 
8 bits 11 1011 
16 bits Ox | 0011 
16 bits 1x 0011 
—all other writes (two cycles)— 0011 


Byte writes are not supported for 8-bit-wide narrow banks. 


Figure 8-6 shows the timing of a write to the ROM address space. The CAS3—CASO 
signals have exactly the same timing as RSWE. 





Burst-Mode ROM Accesses (Am29200 Microcontroller) 


Figure 8-7 shows the timing of a burst-mode ROM access, for direct connection to 
burst-mode devices. Burst-mode accesses have a minimum of one wait state for the 
initial access, even when wait states are programmed as zero; sequential access after 
that are single cycle. Burst-mode writes are not.supported. Burst-mode ROM accesses 
are not supported on the Am29205 microcontroller. 


Use of WAIT to Extend ROM Cycles 


If the WAIT signal is asserted two cycles before the end of a ROM access (that is, two 
cycles before the cycle in which ROMCSx would normally be deasserted), the processor 
extends the ROM access until WAIT is deasserted. This permits the system to extend 
the ROM access indefinitely. The access ends on the cycle after WAIT is deasserted, 
both for reads (Figure 8-8) and for writes (Figure 8-9). Note that the wait state counter 
continues to count while WAIT is active, so that the cycle is controlled by either the wait 
State counter or WAIT, depending on which has the longer duration. Note that WAIT will 
not be recognized by any bank programmed for zero wait states. © 











The WAIT signal on the Am29200 microcontroller can also be used to extend individual 
accesses in a sequence of burst-mode accesses. For each access, the processor does 
not consider the data to be valid until the cycle after WAIT is High (Figure 8-7). 
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Figure 8-6 Byte Write to ROM Bank (using CAS3-CASO as byte strobes) 
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Figure 8-7 Burst-Mode ROM Read (Am29200 Microcontroller) 
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Figure 8-8 Extending a ROM Read Cycle with WAIT 
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| 9 DRAM CONTROLLER 





This chapter describes the DRAM interface on the Am29200 and Am29205 microcon- 
trollers. The programmable registers are presented, followed by a discussion of DRAM 
accesses, address mapping, and address multiplexing. Mapped DRAM accesses, 
page-mode timing, DRAM refresh, and video DRAM transfers are also described. 


9.1 OVERVIEW 


The Am29200 and Am29205 microcontrollers directly support DRAM devices without 
any additional components, providing RAS and CAS generation, address multiplexing, 
and refresh generation. The on-chip DRAM controller utilizes page-mode accesses and 
CAS-before-RAS refresh to extract maximum performance from DRAM devices. 


The DRAM interface accommodates up to four banks of DRAM that appear as a 
contiguous memory. Each bank on the Am29200 microcontroller is individually configur- 
able in width; the Am29205 microcontroller supports only 16-bit wide DRAM banks . In 
addition, four 64-Kbyte regions of the DRAM can be mapped into a 16-Mbyte virtual 
address space. The DRAM controller provides a fixed access time of three cycles plus 
one cycle of RAS precharge after each access. Two cycle page- -mode accesses are 
supported. 


To support a lower pin count, several signals used by the Am29200 microcontroller for _ 
DRAM interfacing are not available on the Am29205 microcontroller. Because the external 
data bus is only 16-bits wide, there need be only two CAS signals (one CAS per byte), 
labeled CAS3 and CAS2. The internal circuitry of the Am29205 microcontroller automati- 
cally concatenates the two 16-bit accesses, using big-endian structure for a full 32-bit 
word. The TR/OE signal for normal DRAM output enable and video DRAM transfer is not 
available on the Am29205 microcontroller. Any DRAM with an OE line should be tied to 
CAS for the bank, or tied to ground (asserted) as internal DRAM logic gates OE with CS. 
Video DRAM transfers are not supported on the Am29205 microcontroller. 


9.2 PROGRAMMABLE REGISTERS 
9.2.1 DRAM Control Register (DRCT, Address 80000008) 
The DRAM Control Register (Figure 9-1) controls the access to and refresh of DRAM 


Banks 0 through 3. 


Figure 9-1 DRAM Control Register 
0 


PGo! res Pat pG2! 
DWO LM DW1 DW2 sis 
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Bit 31: Page-Mode DRAM, Bank 0 (PG0)—When this bit is 1, burst-mode accesses to 


~ DRAM Bank 0 are performed using page-mode accesses for all but the first access. 


When this bit is 0, page-mode accesses are not performed. 


Bit 30: Data Width, Bank 0 (DW0)—This field indicates the width of the DRAM in Bank 
0, as follows: 


DW Value DRAM Width 
0 32 bits (Reserved on Am29205 microcontroller) 
1 16 bits 


Since the Am29205 microcontroller euppeie only 16-bit DRAM, all DWx bits should be 
Set to 1. 


Bit 29: Reserved 


Bit 28: Large Memory (LM)—This bit controls the size of the DRAM banks and the total 
size of the DRAM address space. If the LM bit is 0 on either microcontroller, each DRAM 
bank is up to 4 Mbytes in size, for placement within a 16 Mbyte address space. — 


If the LM bit is 1 on the Am29200 microcontroller, each DRAM bank is up to 16 Mbytes 
in size, for placement within a 64-Mbyte address space. Ifthe LM bitis 1 onthe — 
Am29205 microcontroller, each DRAM bank is up to 8 Mbytes i in size, for placement 
within a 64-Mbyte address space. 


PG1, DW1, and soon perform functions similar to PGO and DW0 for DRAM ranks 1 
through 3. 


Bit 15: Static-Column DRAM (SC)—Whern this bit is 1, page-mode accesses to the 
DRAM are performed using static-column accesses. Static column accesses differ from 
page-mode cycles only in that CAS3—CASO are held Low throughout a read access. The 
timing of the access is not affected, and write accesses are not affected. When this bit is 
0, normal page-mode accesses are performed, if enabled. 


Bits 14-9: Reserved 


Bits 8-0: Refresh Rate (REFRATE)—This field indicates the number of MEMCLK 
cycles between DRAM refresh intervals. A DRAM refresh interval is the time required to 
refresh all four DRAM banks. “CAS before RAS” cycles are performed, overlapped in the - 
background with other non-DRAM accesses when possible. If one or more banks have 
not been refreshed in the background when the REFRATE interval expires, the proces- 
sor forces a panic mode refresh of the unrefreshed banks. 


A zero in the REFRATE field disables refresh. Upon reset, this field is initialized to the 
value 1FFh. 


DRAM Configuration Register (DRCF, Address 8000000C) 


The DRAM Configuration Register (Figure 9-2) controls the selection of DRAM Banks 0 
through 3. In most systems, this register should be set by software to cause the four 
banks of DRAM to appear as a single, contiguous region of memory. | 
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DRAM Configuration Register 


31 15 7 0 
ASELO |AMASKO| - ASEL1 |AMASK1]/ ASEL2 |AMASK2} ASEL3 | AMASK3 


Bits 31-27: Address Select, Bank 0 (ASELO)—On a load, store, or instruction access, 
this field is compared against bits of the access address, with the comparisons possibly 
masked by the AMASKO field. The unmasked bits of the ASELO field must match the 
corresponding bits of the address for DRAM bank 0 to be accessed. | 


Bits 26-24: Address Mask, Bank 0 (AMASKO)—This field masks the comparison of 
the ASELO field with bits of the address on an access, to permit various sizes of 
memories and memory chips in DRAM Bank 0 (“aa(x:y)” represents a field of address 
bits x through y, inclusive). — 








AMASKO Address Comparison Address Comparison 
Value (LM=0) - (LM=1) 
000 ASELO(4:0) to ad(23:19) ASELO(4:0) to aa(25:21 
001 ASELOA: :1) to eaes Ot ASELO(4:1) to aioe. 122 
011 ASELO(4:2) to aa(23:21 ASELO(4:2) to aa(25:23 
111 ASELO(4:3) to ad(23:22) ASELO(4:3) to aa(25:24) (Reserved on Am29205) 


Only the AMASKO values shown in the above table are valid. 
Other bits of this register have a definition similar to ASELO and AMASKO for DRAM 
Banks 1 through 3. 


DRAM Mapping seiaaie 0 (DRMO, Address 8000001 0) 


This register (Figure 9- ie specifies one of four pestle mappings of a mapped DRAM 
access. 


DRAM Mapping Register 0 


31 23 15 : 7 0 
te VIRTBASE res PHYBASE 
‘ . 
VALID 


Bit 31: Valid Mapping (VALID)—This bit, when 1, indicates that the mapping specified — 
by the VIRTBASE and PHYBASE fields is valid. | 


Bits 30—24: Reserved 


Bits 23-16: Virtual Base Address (VIRTBASE)—This field specifies the virtual base 
address of the mapped region. On a mapped DRAM access, it is compared against bits 
23-16 of the address generated by the load or store instruction. The comparison must 
match for the mapping to be performed. 
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Bits 15-8: Reserved 


Bits 7-0: Physical Base Address (PHYBASE)—This field specifies the physical base 
address of the mapped region. On a mapped DRAM access, if the comparison of the 
virtual base address yields a match and the VALID bit is 1, the PHYBASE field replaces 
bits 23-16 of the address. 


DRAM Mapping Register 1 (DRM1, Address 80000014) 


This register is identical in layout and definition to the DRAM Mapping Register 0. 
It Spence the second of the four possible mappings. 


DRAM Mapping Register 2 (DRM2, Address 8000001 8) 


This register is identical in layout and definition to the DRAM Mapping Register 0. 
It specifies the third of the four possible mappings. 


DRAM Mapping Register 3 (DRM3, Address 8000001C) 


This register is identical in layout and definition to the DRAM Mapping Register 0. 
It specifies the the fourth of the four possible mappings. 


Initialization 

The configuration of DRAM banks, if present, must be set by software before normal 
DRAM accesses are performed (the DRAM may be accessed using default parameters 
that are set by software to determine the configuration of the DRAM). The DRAM 
Mapping registers are not initialized by a processor reset, and must be set by 

software before a mapped DRAM access occurs. The REFRATE field is initialized 

on reset to the value 1-Fh. DRAM power-up requirements must be guaranteed by 
software. 


DRAM ACCESSES 


DRAM Address Mapping 


To map logical memory banks to physical addresses, each DRAM bank uses two fields 
to determine the location of the bank in physical memory: AMASK and ASEL. AMASK 
selects the number of address bits decoded and thus the size of a given bank. ASEL 
contains the address bit values compared against the address and thus the location of a 
given bank. The LM bit controls the maximum size of the banks and the total size of the 
DRAM address space as shown. 


AMASK Value Bank Size (LM=0) Bank Size (LM=1) 
000 512 Kbytes — 2 Mbytes 
001 1 Mbyte 4 Mbytes 
011 2 Mbytes 8 Mbytes” 


111 _ 4Mbytes 16 Mbytes” 


* Am29200 microcontroller only 


The ASEL and AMASK fields allow the four DRAM banks to appear as a contiguous 
region of DRAM, with the restriction that a bank of a certain size must fit on the natural 
address boundary for that size. For example, a 2-Mbyte DRAM must be placed ona 
2-Mbyte address boundary. For this reason, DRAM banks must appear in the address 
space in order of decreasing bank size. Note that to achieve a contiguous memory, the 
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various DRAM banks need not appear in sequence in the address space. For example, 
DRAM Bank 3 may appear in an address range below the address range for DRAM 
Bank 1 or 2. This provides flexibility in meeting the restriction that DRAM banks appear 
in the address space in order of decreasing size. Setting AMASK to 0 and ASEL to 1Fh 
reduces the probability of empty banks being inadvertently decoded. This configures the 
bank as small and as high in memory as possible. 


Address Multiplexing | 


The address multiplexing for the DRAMs is performed directly by the processor on the 
A14—-A1 pins, and no external multiplexing is required. As shown in Table 9-1 and 

Table 9-2, only the odd physical address pins from A9 and above (AQ, A11, and A13) are 
used for 16-bit interfaces, while only even physical address pins above AQ (A10, A12, 
and A14) are used for 32-bit memories. Address bit AO is not represented, since the 
Am29200 microcontroller supports only 16- and 32-bit DRAM widths. Address multiplex- 
ing for 16- and 32-bit DRAM memories is performed as shown in Table 9-1 and 

Table 9-2 (“ax” represents address bit x). 


Address Multiplexing for 16-bit DRAM Memory 


Address Pin RAS Asserted . 





Bank Depth 
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Note: * indicates signals not applicable to the bus width. 
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Table 9-2 Address Multiplexing for 32-bit DRAM Memory (Am29200 Microcontroller) 


, Bank Depth Bank Depth 
Address Pin RAS Asserted CAS Asserted (LM=0) (ea) (LM=1) (ea) 


ee +My 16 Mbyte 
a20 . 
ai9 
ai7 
ai4 
ai3 
ai 





a23 

512 Kbyte 1 Mbyte 
oe ee a 
Lae ee] 


stay Moy 





=e 
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a a 


Note: * indicates signals not applicable to the bus width. 


Table 9-3 shows how this multiplexing of addresses supports various configurations of 
memory densities and memory widths, assuming the individual DRAMs are 4 bits wide. 
The addresses shown in Table 9-3 are the address bits for an access. Table 9-4 shows 
how the various memories should be connected to the processor’s address pins to 
realize this address multiplexing, again assuming the individual DRAMs are 4 bits wide. 


Sequential accesses can use page-mode accesses, even though not all CAS address bits 
are contiguous address bits, because the processor does not generate a page-mode 


Table 9-3 DRAM Address Multiplexing (by-4 DRAMs) 
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DRAM Address Connections to Microcontroller (by-4 DRAMs) 


DRAM multiplexed address bits 





access across a 1-Kbyte address boundary. Thus, the processor will not change any 
address bits other than a(9:1) during a page-mode access. 


32-Bit DRAM Width (Am29200 Microcontroller) 


For a data access, the width of each DRAM bank on the Am29200 microcontroller can 
be programmed to be either 32 or 16 bits by the DRAM Control Register. If the DRAM is 
32 bits wide, 1D31—IDO are used to transfer data to and from the processor, and the 
processor performs one access to read or write a byte, half-word, or word. The © 
CAS3-CASO signals are asserted as follows (the value “O” is Low, “1” is High, and “x” is 
a don’t care): 


Data Width A1-A0 CAS3—CAS0 (on write) 


8 bits 00 0111 
8 bits 01 1011 
8 bits 10 1101 
8 bits 11 1110 
16 bits Ox 0011 
16 bits 1x 1100 
32 bits 00 (one cycle) 0000 


16-Bit DRAM Width 


If the DRAM is 16 bits wide on either the Am29200 or Am29205 microcontroller, only 
ID31-ID16 are used to transfer data to and from the processor and the processor 


_ performs two accesses to read or write a full word. 


To read a 32-bit word from a 16-bit DRAM bank, the processor first reads the high-order 
16 bits of the word, then generates a second access to read the low-order 16 bits of the 
word. The address is incremented by two for the second access. To read an 8-bit byte or 
16-bit half-word from a 16-bit DRAM, the processor performs only a single access. 
Alignment and sign extension are performed as usual, except the required byte or 
half-word is received on ID31-ID16. Figure 9-4 shows the location of bytes and half- 
words from a 16-bit DRAM bank. In Figure 9-4, bytes and half-words are numbered as 
they are numbered in a word. 


To write a 32-bit word into a 16-bit DRAM bank, the processor 1 first writes the high- 

order 16 bits of the word, then generates a second access to write the low-order 16 bits 
of the word. The address is incremented by two for the second access, and the low 
order bits of the word appear on ID31-ID16. To write an 8-bit byte or 16-bit half-word on 
a 16-bit bus, the processor performs only a single access. For a byte write, the appropri- 
ate byte is replicated on both ID31-ID24 and !D23-ID16. For a half-word write, the 
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Figure 9-4 Location of Bytes and Half-Words on a 16-Bit Bus 
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Byte 0 Byte 1 
ID31 15 0 


appropriate half-word appears on ID31-ID16. The CAS3—CASO signals are asserted as 
follows (the value “O” is Low, “1” is High, and “x” is a don’t care): 


Data width A1-A0_ CAS3-CAS0 (on write) 
8 bits 00 | 0111 
8 bits 01 1011 
8 bits 10 0111 
8 bits 11 1011 

16 bits Ox 0011 

16 bits 1x 0011 

—all other writes (two cycles)— 0011 

9.3.5 Mapped DRAM Accesses 


Processor DRAM accesses in the 16-Mbyte address range 50000000h-SOFFFFFFh are 
mapped to one of four 64-Kbyte regions of the DRAM. This provides a virtual memory 
region supporting functions such as image compression and decompression that yield 
lower overall memory requirements and thus lower system cost. Only processor DRAM 
accesses can be mapped. DRAM accesses by a DMA channel cannot be mapped. 


DRAM Mapping Registers 0 through 3 each specify a DRAM mapping. Before an access 
to a DRAM location having an address in the range 50000000h—50FFFFFFh, bits 23-16 
of the address are compared to the VIRTBASE fields in each of the DRAM Mapping 
registers. If the address bits match the VIRTBASE field in one of the registers, and the 
associated VALID bit is 1, then the PHYBASE field replaces bits 23-16 of the address 
before the access is performed. If more than one valid comparison occurs, the mapping 
_ specified by DRAM Mapping Register 0 has the highest priority, and the mapping 

specified by DRAM Mapping Register 3 has the lowest priority. If no valid comparison is 
detected, the processor’s User- or Supervisor-mode Instruction or Data Mapping Miss 

“ occurs, depending on the program mode and type of access. . | 


9.3.6 Normal Access Timing 


Figure 9-5 shows the timing for a normal DRAM read cycle. Figure 9-6 shows the timing 
for a normal DRAM write cycle. DRAM cycles are fixed at four cycles including precharge 
and cannot be extended with WAIT. An additional cycle is taken after the data is read or 
written to permit time for RAS precharge. The rising edge of RAS occurs on the third 
rising edge of MEMCLK after the beginning of the cycle. 
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Figure 9-5 DRAM Read Cycle 
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Figure 9-6 DRAM Write Cycle 
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9.3.7 Page-Mode Access Timing 


Page-mode accesses can be enabled for each bank to reduce the average access time 
for a sequence of accesses. If enabled, page-mode accesses are performed for 
instruction accesses and for the LOADM and STOREM instructions. Page-mode 
accesses permit an access time of two cycles for all but the first access. When the 
DRAM bank is 16 bits wide, two accesses are required to obtain a 32-bit word. Page- 
mode accesses are performed to access the second 16 bits in this case if page-mode 
accesses are enabled. 


Figure 9-7 shows the timing for a page-mode DRAM read cycle. Figure 9-8 shows the 
timing for a page-mode DRAM write cycle. Static-column accesses are performed if 
SC=1 in the DRAM Control Register. Static-column accesses differ from page-mode 
accesses only in that CASx remain Low throughout the access. 


9.3.8 DRAM Refresh 


“CAS before RAS” refresh cycles are performed periodically, as determined by the 
REFRATE field of the DRAM Control Register. The REFRATE field specifies the number 
of MEMCLK cycles in a refresh interval; a zero in this field disables refresh. The 
microcontroller ensures that one row of each DRAM bank is refreshed in every interval. 
Each bank is refreshed separately to distribute the demand placed on the DRAM power 
supplies by the individual banks. | 


Figure 9-9 shows the timing of a refresh cycle. Because refresh cycles use only the 
RASx and CASx signals, the processor attempts to perform refresh in the background, 
refreshing each bank in the cycles that the DRAM is not being used, possibly overlapped 


Figure 9-7 DRAM Page-Mode Read Cycle 
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Figure 9-8 DRAM Page-Mode Write Cycle 
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Figure 9-9 DRAM Refresh Cycle 
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with ROM and PIA accesses. Background refresh incurs very little overhead. The 
average penalty of background refresh is about 2 cycles per refresh interval. This 
penalty arises because the processor sometimes attempts to access the DRAM after a 
refresh cycle has been started. If one or more banks has not been refreshed by the end 
of a refresh interval, the DRAM controller performs “panic mode” refresh cycles to 
refresh the remaining banks. Panic mode refresh cycles take priority over all other 
processor accesses. 


9.3.9 Video DRAM Interface (Am29200 Microcontroller) 


A video DRAM (VDRAM) transfer cycle is performed during accesses in the range 
60000000h- 63FFFFFFh. These cycles permit the transfer of data toa VDRAM shift 
register in graphics applications. 


For VDRAM transfer cycles with 16- bit memories, the DRAM bank’s page-mode bit 
(PGx) in the DRAM Control Register must be turned off. 


Figure 9-10 shows the timing of a VDRAM transfer cycle. Note that the ID bus is not 
forced to high impedance. This cycle differs from a normal DRAM cycle because the 
signal TR/OE is asserted with different timing. 


Figure 9-10 VDRAM Transfer Cycle (Am29200 Microcontroller) 


ID3i1-IDO 





9-12 ; = DRAM Controller 


er ra 


4 O PERIPHERAL INTERFACE ADAPTER 





This chapter describes the peripheral interface (PIA) adapter on the Am29200 and 
Am29205 microcontrollers. Information is provided on the programmable registers, 
initialization, and PIA accesses, including timing. | 


10.1 OVERVIEW 


PIA space on the microcontroller is divided into regions, each of which can be directly 
attached to an off-chip peripheral device. The microcontroller’s dedicated PIA chip select 
signals will assert a peripheral device’s chip select input pin when the associated PIA 
region on the microcontroller is read or written. 


With six PIA chip select signals, PPACS5—-PIACSO, the Am29200 microcontroller permits 
direct attachment of up to six external peripheral devices, each with its own 24-bit 
address space, for a maximum size of 16 Mbytes per PIA region. 


With two PIA chip select signals. and a smaller address bus, the Am29205 microcontrol- 
ler supports up to two peripheral devices, each with its own 22-bit memory space, for a 
maximum size of 4 Mbytes per PIA region. The 16-bit ID bus of the Am29205 microcon- 
troller limits PIA support to 8- or 16-bit peripherals. The PIACS5—-PIACS2 signals are not 
supported on the Am29205 microcontroller. — 








10.2. PROGRAMMABLE REGISTERS 


10.2.1 PIA Control Register 0/1 (PICTO/1, Address 80000020/24) 


The PIA Control Registers (Figure 10-1 and Figure 10-2) control the access to PIA 
Regions 0 through 5 on the Am29200 microcontroller. The PIA Control Register 1 is not 
available on the Am29205 microcontroller, since that product does not support PIA 
Regions 2, 3, 4, or 5. | 


Figure 10-1 PIA Control Register 0 (PICTO, Address 80000020) 
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Figure 10-2 PIA Control Register 1 (PICT1, Address 80000024) (Am29200 Microcontroller) 
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Bit 31: Input/Output Extend, Region 0 (IOEXT0)—'If this bit is one, the end of a PIA 
access is extended by one cycle after PIAOE is deasserted or by two cycles after 
PIAWE is deasserted. This provides one additional cycle of output disable time or data 
hold time for reads and writes, respectively. 


Bits 30—29: Reserved 


Bits 28—24: Input/Output Wait States, Region 0 (IOWAIT0)—This field specifies the 
number of wait states taken by an access to PIA Region 0. An I/O read cycle takes at 
least three cycles (two wait states), and an I/O write cycle takes at least four cycles 
(three wait states). If the IOWAITO field specifies an insufficient number of wait states for 
an access (for example, IOWAITO = 00010b for a write), the processor takes the 
required minimum number of wait states instead of the specified number. 


Other bits perform similar functions to IOEXTO and IOWAITO for PIA Regions 1 through 5. 








Bits 15—0: Reserved. These bits are reserved on the Am29205 microcontroller and 
should be written with Os to ensure compatibility. 


Initialization 


The configuration of PIA regions, if present, must be set by software before PIA 
accesses are performed. Peripherals may be accessed using default parameters set by 
software to determine the presence and/or configuration of the peripherals. 


PIA ACCESSES 


PIA accesses are performed as a result of load and store instructions having an address 
within the range of the specific PIA regions supported by the microcontroller. The 
Am29200 microcontroller supports PIA Regions 5 through 0; the Am29205 microcontrol- 
ler supports PIA Regions 1 through O. The PIA region number determines which PIACSx 
signal is asserted during the access. PIACSO is asserted for an access to PIA Region 0, 
and so on. | 








The Am29200 microcontroller supports PIA accesses within the address ranges of PIA 
Region 0 (addresses 90000000h—90FFFFFFh) through PIA Region 5 (addresses 
95000000h-95FFFFFFh). The data width of the load or store (selected by the OPT bits 
of the instruction) determines the width of the access. An 8-bit device must be attached 
to ID7-IDO on the Am29200 microcontroller and a 16-bit device must be attached to 
1D15—IDO. LOADM and STOREM instructions (possible only for 32-bit accesses) are 
performed as a series of simple loads and stores. 


The Am29205 microcontroller supports PIA accesses within the address ranges of PIA 
Region 0 (addresses 90000000h-90FFFFFFh) through PIA Region 1 (addresses 
91000000h-91FFFFFFh). Since the instruction/data bus on the Am29205 microcontrol- 
ler is 16 bits wide, only 8- or 16-bit peripherals are supported. They are attached to 
ID23-ID16 and ID31-ID16, respectively. Only byte or half-word accesses (specified by 
the OPT bits in the load and store instructions) are allowed on these peripherals. 
Performing a 32-bit data access on these peripherals may result in unpredictable data. 


When a byte access is made to the PIA region on either microcontroller, the two least 
significant bits of the address must be 11. When a half-word access is made, the two 
least significant bits must be 10. Instruction fetching from a PIA region is not supported. 


Normal Access Timing 


Figure 10-3 shows the timing of a PIA read cycle. The address is driven in the first cycle, 
the PIACSx signal is asserted in the second cycle to allow for address setup, and the 
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PIAOE signal is asserted in the third cycle to allow for chip select setup. The data must 
be valid after the number of cycles specified by IOWAITx+1. After sampling the data, the 
microcontroller deasserts PIACSx and PIAOE. The interface operates such that the 
processor allows at least one cycle before it drives the instruction/data bus (ID31—IDO on 
the Am29200 microcontroller and ID 31-16 on the Am29205 microcontroller) for a new 
access (though a new address may be driven on the address bus immediately), 
providing one cycle for the peripheral to disable its drivers. If this cycle is insufficient, » 
setting the IOEXTx bit for the region causes the processor to insert an additional cycle 
after the read before starting a new access. 








Figure 10-4 shows timing of a PIA write cycle. The PIAOE signal is not asserted. 
Instead, the processor drives data in the second cycle and asserts the PIAWE signal in 
the third cycle to allow for address, data, and chip select setup. The PIAWE signal is 
deasserted one cycle before the final cycle to provide data hold time for the write. If one 
cycle of hold time is insufficient, setting the IOEXTx bit for the region causes the 
processor to insert an additional cycle of data hold time. 








Use of WAIT to Extend I/O Cycles 


The WAIT signal is used to extend the number of wait states beyond the number 
determined by the IOWAITx field. WAIT can be asserted during a read at any time up 








~ until two cycles before PIAOE is deasserted, and can be asserted during a write at any 








time up until two cycles before PIAWE is deasserted. In response to WAIT, the proces- 
sor extends the access until WAIT is deasserted. If WAIT is asserted within the appropri- 
ate amount of time, a normal read access ends on the cycle after WAIT is deasserted 
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Figure 10-4 PIA Write Cycle 
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(Figure 10-5), and a normal write access ends on the second cycle after WAIT is 
deasserted, to provide data hold time (Figure 10-6). If IOEXTx=1, the processor waits 
one more cycle after a read access to begin a new access, and inserts one more cycle 
of data hold time after a write access. 


10-4 Peripheral Interface Adapter 


AMD &A 


Figure 10-5 Extending a PIA Read Cycle with WAIT 
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Figure 10-6 Extending a PIA Write Cycle with WAIT 
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4 4 DMA CONTROLLER 


This chapter describes the on-chip DMA controller. Programmable registers and 
initialization are detailed, followed by a discussion of DMA transfers, queuing, and 
random DMA by external devices. 


The DMA controller supports three types of DMA transfers: internal, external, and 
direct transfers. 


11.1 OVERVIEW 


The on-chip DMA controller provides a means to transfer data between the DRAM and 
internal or external peripherals. Each supported DMA channel on the Am29200 and 
Am29205 microcontrollers is configurable for width, direction, address increment or 
decrement, external request type, and external peripheral wait states. 


Internal DMA transfers can be requested by the parallel port, serial port, and video 
interface. Each of these internal peripherals has a field in its control register for 
specifying which of the two DMA channels is to be used for the transfer. 


External DMA transfers are requested by off-chip peripherals. 


Direct DMA transfers transfer data between an external device and DRAM using an 
address supplied by the device. The GREQ and GACK signals are used to perform 
direct DMA. Direct DMA is not supported on the Am29205 microcontroller. 








The Am29200 microcontroller has two DMA channels, DMA Channel 0 and DMA | 
Channel 1, each capable of performing either internal or external DMA transfers. One of 
these channels, DMA Channel 0, supports queued transfers. Using the GREQ and 
GACK signals, the Am29200 microcontroller also supports direct DRAM and ROM 
access by an external device such as an external DMA controller. 


The Am29205 microcontroller has one externally controlled DMA channel, DMA 
Channel 1, and two internally controlled channels, DMA Channels 0 and 1, available for 
use by the internal peripherals only. 32-bit DMA transfers between internal peripherals 
and DRAM are supported. However, DMA transfers between external peripherals and 
DRAM are limited to 8- or 16- bit data accesses. The DREQO, DACKO, GREQ, GACK, 
and TDMA signals are not supported on the Am29205 microcontroller. 











11.2 PROGRAMMABLE REGISTERS 


11.2.1 DMAO Control Register (DMCTO, Address 80000030) 


The DMAO Control Register (Figure 11-1) controls DMA Channel 0 on the Am29200 
and Am29205 microcontrollers. DMA Channel 0 on the Am29205 microcontroller is 
available for transfers between internal peripherals and DRAM only; external transfers 
are not supported. 
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Figure 11-1 DMAO Control Register 
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Bit 31: DMA Extend (DMAEXT), Am29200 microcontroller—The DMAEXT bit serves 
a function very similar to the IOEXTx bits in the PIA Control registers. This bit is set to 
provide an additional cycle of output disable time for a read or an additional cycle of data 
hold time for a write. This bit is reserved on the Am29205 microcontroller. 


Bits 30—29: Reserved 


Bits 28—24: DMA Wait States (DMAWAIT), Am29200 microcontroller—trhis field 
specifies the number of wait states taken by an external access by DMA Channel 0. An 
external DMA read cycle takes at least three cycles (two wait states) and an external 
DMA write cycle takes at least four cycles (three wait states). If the DMAWAIT field 
specifies an insufficient number of wait states for an access (for example, DMAWAIT = 
00010b for a write), the processor takes the required minimum number of wait states 
instead of the specified number. This field is reserved on the Am29205 microcontroller. 


Bits 23-22: Data Width (DW)—This field indicates the width of the data transferred by 
the DMA channel, as follows: 


DW Value DMA Transfer Width 
00 32 bits timer and internal transfers on Am29200 microcontroller) 
32 bits (Internal transfers on Am29205 een 
01 8 bits 
10 16 bits 
11 32 bits, address nienangad (Reserved on Am29205 microcontroller) 


On the Am29200 microcontroller, the value DW=11 is used to repeatedly transfer a fixed 
pattern from a single DRAM location to a peripheral. For example, it can be used to 
transfer to a blank area of a printed page without requiring that a memory buffer be 
allocated for the blank area. 


Bits 21-20: DMA Request Mode (DRM), Am29200 microcontroller—This field 
indicates how external DMA requests are signaled by DREQO, as follows: © 


DRM Value . DREQO Request 
00 Active Low 
01 Active High 
10 High-to-Low transition 
11 Low-to-High transition 


The DRM field is set to 00 by a processor reset. See Section 11.3.6 for information on 
clearing latched DMA requests. This field is reserved on the Am29205 microcontroller. 
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Bit 19: Assert Chip Select (ACS), Am29200 microcontroller—This bit controls 
whether DMA Channel 0 asserts PIACSO during an external peripheral access. If the 
ACS bit is 1, the DMA channel asserts PIACSO; if the ACS bit is 0, the DMA channel 
does not assert PIACSO. This bit is reserved on the Am29205 microcontroller. 


Bits 18-10: Reserved 


Bit 9: Transfer Up/Down (UD)—This bit controls the addressing of memory for the 
series of DMA transfers. If the UD bit is 1, the DMA address (in the DMAO Address 
Register) is incremented after each transfer. If the UD bit is 0, the DMA address is 
decremented after each transfer. The amount by which the address Its incremented or 
decremented is determined by the width of the transfer, as follows: 











DW Value Address incr/Decr 


00 (82 bits) +/—4 
01 if bits) +/—1 
10 (16 bits) +/-2 
11 (32 bits) +/—O (Reserved on Am29205 microcontroller) 


Bit 8: Read/Write (RW)—This bit controls whether the DMA transfer is to or from the 
DRAM. If the RW bit is 1, the DMA channel transfers data from the DRAM to the 
peripheral. If the RW bit is 0, the DMA channel transfers data from the peripheral to the 
DRAM. | 


Bit 7: Enable (EN)—This bit enables the DMA channel to perform transfers. A 1 enables 
transfers, and a 0 disables transfers. 


Bit 6: TDMA Terminate Enable (TTE), Am29200 microcontroller—This bit, when 1, 
causes the DMA channel to sample the TDMA signal during an external DMA transfer 
and to terminate the transfer if TDMA is asserted. TDMA does not apply to an internal 
transfer. If this bit is 0, the TDMA signal is ignored. This bit is reserved on the Am29205 
microcontroller. 


Bit 5: Count Terminate Enable (CTE)—This bit, when 1, causes the DMA channel to 
terminate the transfer when the DMACNT field of the DMA Count Register decrements 
past zero. If this bit is 0, the DMA transfer does not terminate, though the DMA channel 
still decrements the count after every transfer. 


Bit 4: Queue Enable (QEN)—This bit, when 1, enables the DMA queuing feature (which 
is implemented only on DMA Channel! 0). DMA queuing allows the DMAO Address 
Register and DMAO Count Register to be reloaded automatically at the end of a DMA 
transfer from the DMAO Address Tail Register and the DMAO Count Tail Register, 
respectively. Queuing permits a second transfer to start immediately after a first transfer 
has terminated, greatly reducing the response-time requirement for software to set up 
and start the second transfer. When this bit is 0, DMA queuing is disabled, and the 
DMAO Address Register and DMAO Count Register are set directly to initiate a transfer. 


Bits 3-2: Reserved 


Bit 1: TDMA Terminate Interrupt (TTI), Am29200 microcontroller—The TTI bit is 
used to report that the DMA channel has generated an interrupt because of TDMA 
termination. If the TTE bit is one and the TDMA signal is asserted during an external 
DMA transfer, the TTI bit is set and a processor interrupt occurs. This bit is reserved on 
the Am29205 microcontroller. 
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Figure 11-2 


Bit 0: Count Terminate Interrupt (CTI)—The CTI bit is used to report that the DMA 
channel has generated an interrupt because of count termination. If the CTE bit is one 
and the DMACNT field decrements past zero, the CTI bit is set and a processor interrupt 
occurs. 


DMAO Address Register (DMADO, Address 80000034) 
The DMAO Address Register (Figure 11-2) contains the addresses for a transfer by DMA 


~ Channel 0. 
DMAO Address Register | 
31 | 23 15 7 0 





= Reserved on Am29205 microcontroller 


Bits 31-24: Peripheral Address (PERADDR), Am29200 microcontroller—This field 
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specifies eight bits that are driven on A7—AO during an external peripheral access by the 
DMA channel. A23—A8 are driven Low during the transfer. The peripheral address 
remains unchanged with each access. This field is reserved on the Am29205 microcon- 
troller, since external DMA transfers are not supported on Channel 0. 


Bits 23-0: DRAM Address (DRAMADDR)—This field contains the DRAM address for 
the next DMA transfer to or from the DRAM. The DRAMADDR field is incremented or 
decremented (based on the UD bit) by an amount determined by the width of the DMA 
transfer. The increment or decrement amount is 1 for a byte transfer, 2 for a halfword 
transfer, and 4 for a word transfer. (Word transfer is not supported onthe Am29205 
microcontroller.) To support repeated transfers from the same word on the Am29200 
microcontroller, the address can be left unchanged. The DRAMADDR field wraps from 
the value O00000h to FFFFFFh when eopemones and from FFFFFFh to 000000h 
when incremented. 


DMAO Address Tail Register (TADO, Address 80000070) — 


This write-only register (Figure 11-3) is the tail of the DMA Channel 0 address queue, 
and is used to write the address of a queued transfer when the QEN bit is 1. 


DMAO Address Tail Register 
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Bits 31-24: Reserved 


Bits 23-0: DRAM Address (DRAMADDR)—This field is written with the beginning 
DRAM address for a queued DMA transfer, if queuing is enabled. 


DMA Controller 


11.2.4 


Figure 11-4 


AMD cl 


DMAO Count Register (DMCNO, Address 80000038) 
The DMAO Count Register (Figure 11-4) specifies the number of transfers remaining to 
be performed by DMA Channel 0. 


DMAO Count Register 
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Bits 31-24: Reserved 


Bit 23-0: DMA Count (DMACNT)—This field normally specifies the number of transfers 
remaining to be performed on the DMA channel. The count is zero-based: a count of 
zero indicates one transfer, a count of one indicates two transfers, and so on. The DMA 
channel! decrements the DMACNT field after every transfer. If the CTE bit is 1, the DMA 
channel generates an interrupt when the DMACNT field is decremented past zero. 
However, if the CTE bit is not 1, the DMACNT field is still decremented after every 
transfer and can be used to determine how many transfers have been performed when 
the DMA channel terminates because of the TDMA signal. 


DMAO Count Tail Register (TCNO, Address 8000003C) 


This write-only register (Figure 11-5) is the tail of the DMA Channel 0 count queue, and 
is used to write the transfer count of a queued transfer when the QEN bit is 1. 


DMAO Count Tail Register 
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Bits 31-24: Reserved 


Bits 23-0: DMA Count (DMACNT)—This field is written with the zero-based number of 
transfers to be performed by a queued DMA transfer, if queuing is enabled. 


DMA1 Control Register (DMCT1, Address 80000040) 
The DMA1 Control Register controls DMA Channel 1. Queuing is not implemented on 
DMA Channel 1. | 
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Bit 31: DMA Extend (DMAEXT)—The DMAEXT bit serves a function very similar to the 
lIOEXTx bits in the PIA Control registers. This bit is set to provide an additional cycle of 
output disable time for a read or an additional cycle of data hold time for a write. 


Bits 30-29: Reserved 


Bits 28-24: DMA Wait States (DMAWAIT)—This field specifies the number of wait 
states taken by an external access by DMA Channel 1. An external DMA read cycle 
takes at least three cycles (two wait states) and an external DMA write cycle takes at 
least four cycles (three wait states). If the DMAWAIT field specifies an insufficient 
number of wait states for an access (for example, DMAWAIT = 00010b for a write), the 
processor takes the required minimum number of wait states instead of the specified 
number. 


Bits 23-22: Data Width (DW)—This field indicates the width of the data transferred by 


the DMA channel, as follows: 


DW Value DMA Transfer Width 
00 32 bits (External and internal transfers on Am29200 microcontroller) © 
; 32 bits (Internal transfers on Am29205 microcontroller) 
01 8 bits 
10 16 bits 
11 32 bits, address unchanged (Reserved on Am29205 microcontroller) 


On the Am29200 microcontroller, the value DW=11 is used to repeatedly transfer a fixed 
pattern from a single DRAM location to a peripheral. For example, it can be used to 
transfer to a blank area of a printed page without requiring that a memory buffer be 
allocated for the blank area. 


Bits 21-20: DMA Request Mode (DRM)—This field indicates how external DMA 


‘requests are signaled by DREQ1, as follows: 


DRM Value — DREQ1 Request 
00 Active Low _ 
01 | Active High 
10.—. ~ High-to-Low transition 
11 | Low-to-High transition 


The DRM field is set to 00 by a processor reset. See Section 11.3.6 for information on 
clearing latched DMA requests. : 


Bit 19: Assert Chip Select (ACS)—This bit controls whether DMA Channel 1 asserts 
PIACS1 during an external peripheral access. If the ACS bit is 1, the DMA channel 
asserts PIACS1; if the ACS bit is 0, the DMA channel does not assert PIACS1. 


Bits 18-10: Reserved 











Bit 9: Transfer Up/Down (UD)—This bit controls the addressing of memory for the 
series of DMA transfers. If the UD bit is 1, the DMA address (in the DMA1 Address 
Register) is incremented after each transfer. If the UD bit is 0, the DMA address is 
decremented after each transfer. The amount by which the address is incremented or 
decremented is determined by the width of the transfer, as follows: 
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DW Value Address Incr/Decr 
00 (32 bits) — +/-4 
01 (8 bits) +/—1 | 
10 (16 bits) +/-2 
11 (82 bits) +/-0 (Reserved on Am29205 microcontroller) 


Bit 8: Read/Write (RW)—This bit controls whether the DMA transfer is to or from the 
DRAM. If the RW bit is 1, the DMA channel transfers data from the DRAM to the 
peripheral. If the RW bit is 0, the DMA channel transfers data from the peripheral to the 
DRAM. | 


Bit 7: Enable (EN)—This bit enables the DMA channel to perform transfers. A 1 enables 
transfers, and a 0 disables transfers. 


Bit 6: TDMA Terminate Enable (TTE), Am29200 microcontroller—tThis bit, when 1, 
causes the DMA channel to sample the TDMA signal during an external DMA transfer 
and to terminate the transfer if TDMA is asserted. TDMA does not apply to an internal 
transfer. If this bit is 0, the TDMA signal is ignored. This bit is reserved on the Am29205 
microcontroller. | 


Bit 5: Count Terminate Enable (CTE)—This bit, when 1, causes the DMA channel to 
terminate the transfer when the DMACNT field of the DMA Count Register decrements 
past zero. If this bit is 0, the CTE field does not terminate the DMA transfer, though the 
DMA channel still decrements the count after every transfer. 


Bits 4—2: Reserved 
Bit 1: TDMA Terminate Interrupt (TTI), Am29200 microcontroller—The TTI bit is 


used to report that the DMA channel has generated an interrupt because of TDMA 


termination. If the TTE bit is one and the TDMA signal is asserted during an external 
DMA transfer, the TTI bit is set and a processor interrupt occurs. This bit is reserved on 
the Am29205 microcontroller. 


Bit 0: Count Terminate Interrupt (CTI)—The CTI bit is used to report that the DMA 
channel has generated an interrupt because of count termination. If the CTE bit is one 
and the DMACNT field decrements past zero, the CTI bit is set and a processor interrupt 
Occurs. . 


DMA1 Address Register (DMAD1, Address 80000044) 


- The DMA1 Address Register contains the addresses for a transfer by DMA Channel 1. It 


is identical in layout and definition to the DMAO Address Register, except that the 
PERADDR field is not reserved on the Am29205 microcontroller for DMA Channel 1. 


DMA1 Count Register (DMCN14, ‘Radiesse 80000048) 


The DMA1 Count Register specifies the number of transfers remaining to be performed 
by DMA Channel 1. It is identical in layout and definition to the DMAO Count Register. 


Initialization 


- The EN bits of the DMAO and DMA1 Control registers are reset to O by a processor 


reset. The DRM fields of both registers are also reset to 0 (see Section 11.3.6). The 
DMA channels must be configured by software before they are used. 
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DMA TRANSFERS 


A DMA transfer is performed as a result of a DMA request. The DMA request can be 
generated either by an internal peripheral (parallel port, serial port, or video interface) or 
by an external device using DREQ1—DREQ0 on the Am29200 microcontroller and 
DREQ1 on the Am29205 microcontroller. 


Specifying the Direction of a DMA Transfer 


The direction of a DMA transfer is determined by the RW bit of the DMA Control 
Register. 


If the RW bit is 0, the DMA channel transfers data from the peripheral to the DRAM. The 
DMA channel first performs an access to read the data from the peripheral and then 
performs a DRAM write to store the data into the DRAM. Both accesses occur won 
interruption: there is no other intervening access. 


If the RW bit is 1, the DMA channel transfers data from the DRAM to the peripheral. The 
DMA channel first performs a DRAM read to access the data and then performs an 
internal or external access to write the data to the peripheral. Both accesses occur 
without interruption: there is no other intervening access. 


Programming Internal DMA Transfers 


Programming an internal DMA transfer using the parallel port, serial port, or video 
interface involves coding the DMA controller registers along with the appropriate internal 
peripheral’s control register, as listed in the following table. 


Internal Peripheral Control Register DMA Enable Field 
Parallel Port Parallel Port Control! Register MODE. 

Serial Port Serial Port Control Register TMODE, RMODE 
Video Interface Video Control Register MODE 


Setting up an internal DMA transfer involves disabling the peripheral and the DMA 
controller, configuring both, and enabling both. Note that the internal peripheral and the 
DMA controller must be disabled before each is configured; after configuration, the 
peripheral must be enabled before the DMA controller is enabled. Otherwise, the steps 


- listed below can be performed in any order. 


The following procedure describes how to program a simple internal transfer, using the 
video interface as an example. 


1. Disable the parallel port, serial port, or video interface with the appropriate field in the 
peripheral’s control register. For example, to disable the video interlace, set the 
MODE field in the Video Control Register to 00. 


2. Disable the DMA channel by writing 0 to the EN bit of the DMAx Control Register. 


3. Program the peripheral’s control register with the appropriate values. For example, | 
program the Video Control Register with values for CLKI, SDIR, VIDI, LSI, PSI, PSIO, 
etc., as required. 


4. Program other peripheral registers as needed. For example, program the Side Matin 
Register of the video interface to set the required page margins. 


5. Program the DMAx Control Register, specifying the address increment or decrement, 


transfer direction, and interrupt enables. 
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6. Program the DMAx Address Register by specifying the DRAM starting address to be 
read or written. 


7. Program the DMAx Count Register. 


8. Enable peripheral DMA requests with the appropriate field in the peripheral’s control 
register. For example, set the MODE field of oe Video Control Register to 10 to en- 
able DMA Channel 0. 


9. Enable the DMA channel by writing a 1 to the EN bit of the DMAx Control! Register. 


Programming External DMA Transfers | 
Programming an external DMA transfer is accomplished by coding the DMA controller 


registers. Note that the DMA controller must be disabled before being configured; 
otherwise, the steps listed below can be performed in any order. 


The following procedure describes how to program a simple external transfer. 


1. Disable the DMA channel by writing a 0 to the EN bit of the DMAx Control Register. 


2. Program the DMAx Control Register, specifying the address increment or decrement, 
transfer direction, interrupt enables, wait states, etc., for external peripherals. 


3. Program the DMAx Address Register by specifying the external peripheral address 
and the DRAM starting address to be read or written. | 


4. Program the DMAx Count Register. 
5. Enable the DMA channel by writing a 1 to the EN bit of the DMAx Control Register. 


Generating External DMA Requests 


The generation of DMA requests by the DREQ1—DREQ0 signals is controlled by the 
DRM field of the DMA control register. The DMA requests can be programmed individu- 
ally to be edge- or level-sensitive for either polarity of edge or level. 


If the DMA request is edge-sensitive, the DMA request signal must remain at the __ 
appropriate level for at least four cycles after the active edge to insure that the DMA 
channel detects the request. An active edge that occurs during an in-progress transfer 
(that is, while DACKx is asserted) is ignored. The DREQx signal must be Low (rising- 
edge-triggered) or High (falling- edge-triggered) for four cycles before a new active edge 
can be recognized. 





If the DMA request is level-sensitive, the quest may be deasserted at any time while 
DACKx is asserted, and must be deasserted during the cycle in which DACKx is 
deasserted unless it is desired to generate a subsequent DMA request. 





\ 


External DMA Transfers 


External DMA transfers appear very much like PIA accesses, except the DMA acknowl- 
edge signals (DACK1—DACKO on the Am29200 microcontroller and DACK1 on the 
Am29205 microcontroller) are asserted during the transfer as well as, optionally, 
PIACS1—PIACSO on the Am29200 microcontroller and PIACS1 on the Am29205 
microcontroller. The address bus is driven with an address derived from the DMA 
Address Register. Bits 23-8 of the address are all Os, and bits 7-0 are driven with the 
PERADDR field. It is possible to use the DACKx signals as chip selects to the DMA 
peripherals. The signals PIAOE, PIAWE, and WAIT are used as they are during a PIA 
access. The DMAWAIT field is used to determine the number of wait States, much as the 
lOWAITx field is used during a PIA access. 
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On the Am29200 microcontroller, if the DRAM is 16 bits wide, a 32-bit DMA DRAM 
access appears as two 16-bit accesses on ID31-ID16. If the peripheral is 8 or 16 bits 
wide, a DMA peripheral access appears as a single access on ID7—IDO or 1D15—IDO, 
respectively. The peripheral must have the same width as the transfer. 


On the Am29205 microcontroller, DMA transfers between external peripherals and the 
DRAM are limited to 8- or 16- bit data accesses. For 8- or 16- bit wide peripherals, a 
DMA access appears on ID23—16 or ID31-—16, respectively. The peripheral must have 
the same width as the transfer. — 


Figure 11-7 shows the timing of a DMA read cycle (performed when the RW bit is 0). 
The DACKx signal (and, optionally, the PIACSx signal) is asserted in the second cycle, 
and the PIAOE signal is asserted in the third cycle. The data must be valid after the 
number of cycles determined by DMAWAIT. If DMAEXT=1, the processor waits one 
more cycle after the read access to begin a new access. The peripheral can use WAIT 
to extend the access. 














Figure 11-8 shows timing of a DMA write cycle (performed when the RW bit is 1). The 
PIAOE signal is not asserted. Instead, the processor drives data in the second cycle and 
asserts the PIAWE signal in the third cycle. The PIAWE signal is deasserted one cycle 
before the final cycle (the number of cycles is determined by DMAWAIT) to provide data 
hold time. If DMAEXT=1, the processor inserts one more cycle of data hold time after a 
write access. The peripheral can use WAIT to extend the access. 














On the Am29200 microcontroller, if the DMA channel’s TTE bit is 1, an external peripher- 
al can assert TDMA at any time while DACKx is asserted to terminate the transfer after 





Figure 11-7 External DMA PIA Read Cycle . 
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Figure 11-8 External DMA PIA Write Cycle 
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number of cycles determined by DMAWAIT+1 . 


le 
DMAWAIT=4 illustrated 


final cycle is 
extended for one 
cycle if DMAEXTx=1 


the current access; in this case, the current access is completed as usual. As with PIA 
accesses, the peripheral can use WAIT to extend the access. 





The DMA channel continues to perform transfers until the count expires or the TDMA 
input is asserted (depending on the CTE and TTE bits). When the transfer terminates, 
the EN bit is reset unless there is an active queued transfer, as explained in Section 
11.4. 


Latching External DMA Requests 
The DMA controller is designed to latch an active transition of the external DREQ line, 


even if such a transition occurs when the DMA is disabled. This latching occurs for both 


edge- and level-triggered modes. The latched transition will then be recognized when 
the DMA channel is enabled, assuming the DRM field has not changed. This latching 
avoids a problem when using edge-sensitive DMA requests. There is the potential to 
lose a request between the time a transfer terminates on the count going to zero (which 
automatically disables the channel, blocking further requests) and the time the DMA 
interrupt handler restarts the channel. 


Any programming of the DMA Control Register that changes the value of the DRM field 
from its previously programmed value will clear any latched request. Thus, to re-enable 
a DMA channel and also clear any latched request, the respective DMA Control Register 
must be written twice. With the first write, the DMA should remain disabled, and a value 
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different from the desired DRM value should be set in the DRM field. On the second 
write, the DMA should be enabled, and the desired value should be set in the DRM field. 


Upon reset, the DRM field is set to 00 (active Low). Therefore, if the DMA is later 

enabled with DRM still at 00, any active Low transition of DREQ since reset will have 
been latched and will be considered an active request when the DMA is enabled. To | 
clear any such latched request, as noted above, the DMA Control Register should be 


_ written twice, once with DMA disabled and DRM set to 11 (or 10 or 01), and finally with 


DMA enabled and DRM set to 00. 


DMA QUEUING (DMA CHANNEL 0) 


The address and count registers for DMA Channel 0 each consist of a two-entry queue, 
with each entry of the queue separately addressable for loading a new transfer. The 
DMAO Address Register and DMAO Count Register are at the head of the queue. The 


_ DMAO Address Tail Register and DMAO Count Tail Register are at the tail of the queue 


and are write-only registers. A DMA transfer queued behind an active transfer can start 
as soon as the first transfer is complete. This reduces the response-time requirement for 
software to load a new transfer: software has the entire transfer time of the second 
transfer to load the next transfer at the tail of the queue. 


DMA queuing is enabled by writing the appropriate address and count values at the 
head of the queue, then setting the DMAO Control Register appropriately, with EN=1, 
QEN=0, and CTE/TTE=1. 


A transfer is loaded into the tail of the queue by first loading the DMAO Count Tail 
Register, then loading the DMAO Address Tail Register (note that the PERADDR field 
cannot be changed by a queued transfer). Writing the tail address causes the QEN bit to 
be set. Whenever a DMA transfer terminates at the head of the queue and the QEN bit 
is 1, the transfer at the tail of the queue advances to the head of the queue and begins 
immediately. When the queued transfer advances to the head of the queue, the QEN bit 
is reset, the EN bit remains set, and the CTI/TTI bit is set (note that the automatic queue 
advance makes it impossible to inspect the count of the former transfer after a TTI 
interrupt in order to discover how many transfers were performed by that transfer). 


The CTI/TTI interrupt handler need not clear the CTI/TTI bit: in fact, it is unsafe to write 
the DMAO Control Register at this point because the termination of the current transfer 
(the transfer that was formerly queued) may be lost. The interrupt handler need only 
place the count and address of the next transfer at the tail of the queue (again, the tail 
address should be loaded after the count, because writing the tail address sets the QEN 
bit and enables the queue to advance). The CTI/TTI bit is automatically reset when the 
tail address is written. 


Queue underflow occurs if the transfer at the head of the queue terminates before the 
next transfer is loaded at the tail of the queue. Software can detect that underflow has 
occurred by examining the EN bit after setting up the next transfer. If the EN bit is 0, 
underflow has occurred, because a successful start of a queued transfer causes the EN 
bit to remain set when the termination interrupt is generated: 


RANDOM DIRECT MEMORY ACCESS BY EXTERNAL DEVICES 
(Am29200 MICROCONTROLLER) 

The Am29200 microcontroller is designed primarily for single-controller applications, and 
it has no provision for other bus masters to control the address and data buses in the 
traditional sense. However, the DMA controller does provide a mechanism for an 
external device to access the ROM or DRAM using addresses provided by the device 
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rather than by a DMA channel. External devices use the GREQ and GACK signals to 
perform a random memory access via the Am29200 microcontroller’s DRAM or ROM 
controller. 


Figure 11-9 shows the timing for a memory read using GREQ and GACK. The external — 
device indicates that it wants to perform a memory access by asserting GREQ. As soon 
as the processor can perform the access, it asserts GACK. The external device can 








place the memory address on ID31-IDO during any cycle following the assertion of 








GACK. The device indicates that the address is valid by deasserting GREQ. The 
processor uses this address to determine whether the access is to ROM or DRAM 
(according to the normal address allocation) and performs the required access. 

Figure 11-9 shows an access to DRAM, as an example. The processor deasserts GACK 
at the beginning of the cycle in which the data is valid on ID31-IDO. The deassertion of 
GACK completes the access. 


Figure 11-10 illustrates how the GREQ/GACK protocol can be used to perform a 
memory write. In this case, the external device supplies the address upon the deasser- 
tion of GREQ and then provides the write data on 1D31-IDO. The processor does not 
distinguish between a read and a write, allowing the ID Bus to be available to the device 
for the transfer of both address and data. The distinction between reads and writes must 
be made by external logic (which, for example, forms the signal we,ew in Figure 11-10) 

in a way that meets the memory timing requirements. For example, an AND gate can be 








External Random DRAM Read Cycle (Am29200 Microcontroller) 
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used to form the negative OR of the processor’s WE signal and the write enable from 
the external device. 


To summarize the use of GREQ and GACK: 


1. The external device asserts GREQ to request an access. 


2. Following the assertion of GACK, the device places the address on ID31-IDO and 
deasserts GREQ to indicate that the address is valid. 


3. Fora read, the device must be able to latch data from ID31-iDO at the end of the 
cycle in which GACK is deasserted. For a write, the device must be prepared to drive 
data on ID31-IDO on the second cycle following the address transfer and must hold 
the data valid until the cycle following the deassertion of GACK, at which time it must 
stop driving. The device must also supply a write enable signal that satisfies the tim- 
ing requirements of the memory. In either case, the processor deasserts GACK 
based on the access timing of the ROM or DRAM. 


To further clarify the use of GREQ and GACK, Figure 11-11 shows example timing for a 
ROM read. Writes to the ROM space are more difficult to implement than DRAM writes 
because the processor always asserts the ROMOE signal. 














Memory accesses using GREQ and GACK are restricted to 32-bit accesses: 8- and 
16-bit accesses are not supported. Zero-wait-state accesses are also not supported. 


Furthermore, the ROM and/or DRAM bank must be 32 bits wide. Although the GREQ/ 
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Figure 11-11 External Random ROM Read Cycle (Am29200 Microcontroller) 
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GACK protocol supports full 32-bit addressing, the addresses supplied must be within 
the range of ROM or DRAM addresses. DRAM mapping cannot be performed. 


During a processor reset, the GREQ input may be used by a hardware-development 

- system to force processor outputs to the high-impedance state. To prevent driver 
conflicts, the system should keep GREQ in a high-impedance state during a processor 
reset. 
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This chapter discusses the programmable 1I/O port available on the Am29200 and 
Am29205 microcontrollers. Eogminee registers, initialization, and operation are 
described. 


12.1 OVERVIEW 


The I/O port permits direct programmable access of up to sixteen external PIO signals, 

as either inputs or open-drain outputs. When used as inputs, eight of these signals, 

PIO15—PIO8, can be programmed to cause edge- or level-sensitive interrupts. The 

Am29200 microcontroller supports sixteen external PIO signals (PIO15—PIO0). The 
~Am29205 microcontroller supports eight PIO signals, (PIO15—PIO8). 


12.2 PROGRAMMABLE REGISTERS 


12.2.1 PIO Control Register (POCT, Address 800000D0) 


The PIO Control Register (Figure 12-1) controls interrupt generation and determines the 
polarity of PIO15—PIOO on the Am29200 microcontroller and PIO15—PIO8 on the 
Am29205 microcontroller. 


Figure 12-1 PIO Control eae 





.] = Reserved on Am29205 microcontroller 


Bits 31-30: Interrupt Request Mode, PIlO15 (IRM15)}—This field enables PIO15 to 
generate an interrupt equivalent to a request on the processor’s INTR@ input, and 
indicates whether PIO15 is level- or edge-sensitive in generating the IGOR: The 
IRM15 field controls PIO15 as follows: 





{RM15 Value PIO15 Interrupt 
00 Interrupt disabled 
01 - Level-sensitive 
10 : Edge-sensitive 
11 IRM15 only — see below | 


The INVERT field (see below) further conditions interrupt generation. If the INVERT bit 
for PIO15 is 0, an interrupt, if enabled, is generated by a High level on PIO15 (level-sen- 
sitive) or on a Low-to-High transition (edge-sensitive) of PIO15. If the INVERT bit for 
P1015 is 1, an interrupt, if enabled, is generated by a Low level on PIO15 (level-sensi- 
tive) or on a High-to-Low transition (edge-sensitive) of PIO15. 
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For IRM15, the value 11 causes PIO15 to generate an edge-triggered interrupt and to 
also set the FBUSY bit in the Parallel Port Control Register (see Section 13.2.1), causing 
the PBUSY output to be asserted. This can be used to support certain system-specific 
features of the parallel port. Note that this value may cause a spurious eetung of FBUSY 
during a reset, depending on the activity on PIO15 after a reset. 


Bits 29-16: IRM14 through IRM8—The IRM14—IRM8 fields enable interrupts and 
specify level- or edge-sensitivity for PIO14—PlO8, respectively. These fields are identical 
in definition to IRM15, except that the value 11 is reserved. 


Bits 15—0: PIO Inversion (INVERT)—This field determines how the level on each PIO 
signal is reflected in the PIO Input and PIO Output Registers, and how interrupts are 
generated. The most significant bit of the INVERT field determines the sense of PIO15, 
the next bit determines the sense of PIO14, and so on. A 0 in this field causes the internal 
and external sense of the respective PIO signal to be noninverted; a High external level is 
reflected as a 1 internally, and a Low is reflected as a 0 internally. A 1 in this field causes 
the internal and external sense of the respective PIO signal to be inverted; a High external 


level is reflected as a 0 internally, and a Low is reflected as a 1 internally. 


Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller and 
should be written with Os to ensure compatibility. 


PiO Input Register (PIN, Address 800000D4) 
The PIO Input Register (Figure 12-2) reflects the external levels of PIO1 5-PIOO on the 
Am29200 microcontroller and PIO15—PIO8 on the Am29205 microcontroller. 


PIO Input Register 
31 23 15 , 7 0 


reserved 


= Reserved on Am29205 microcontroller 





Bits 31-16: Reserved 


Bits 15-0: PIO Input (PIN)—This field reflects the levels on each PIO signal. The most 
significant bit of the PIN field reflects the level on PIO15, the next bit reflects the level on 
PlO14, and so on. The correspondence between levels and bits in this register is | 
controlled by the INVERT field. 


Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller and will 
be read as Os. 


PIO Output Register (POUT, Address 800000D8) 


~ The PIO Output Register (Figure 12-3) determines the levels driven on the PIO signals, 


for those signals enabled to be driven by the PIO Output Enable Register. 


PIO Output Register . 
31 23 15 7 0 





; reserved 


= Reserved on Am29205 microcontroller 
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Bits 31-16: Reserved 


Bits 15-0: PIO Output (POUT)—This field determines the levels on each PIO signal, if 
so enabled by the PIO Output Enable Register. The most significant bit of the POUT field 
determines the level on PIO15, the next bit determines the level on PIO14, and so on. 
The correspondence between levels and bits in this register is controlled by the INVERT 
field. 


Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller and 
should be written with Os to ensure compatibility. 


PIO Output Enable Register (POEN, Address 800000DC) 


The PIO Output Enable Register (Figure 12-4) determines whether or not the PIO 
signals are driven as outputs. 


PIO Output Enable Register 





31 23 15 


reserved 


= Reserved on Am29205 microcontroller 








Bits 31-16: Reserved 


Bits 15—0: PIO Output Enable (POEN)—This field determines whether each PIO signal 
is driven as an output. The most significant bit of the POEN field determines whether _ 
PIO15 is driven, the next bit determines whether PIO14 is driven, and so on. A 1 in a bit 
position enables the respective signal to be driven according to the associated POUT 
and INVERT bits, and a 0 disables the signal as an output. 


Bits 7-0: Reserved. These bits are reserved on the Am29205 microcontroller and 
should be written with Os to ensure compatibility. . 


Initialization 

During a processor reset, all bits of the PIO Output Enable Register are reset to 0, 
disabling all PIO signals as outputs. The I/O port must be initialized by software before 
the I/O port is used. 


OPERATING THE I/O PORT 


The PIO signals are asynchronous to the processor. A change on any PIO signal is 
reflected in the PIO Input Register a maximum of four MEMCLK cycles after the change 
occurs. A level-sensitive interrupt occurs four cycles after the change, and an edge-sen- 
sitive interrupt occurs five cycles after the change. When driven as an output, a change 
to the PIO Output Register is reflected on the PIO signals a maximum of one cycle after 
the change occurs. All the PIO signals have additional metastable hardening, allowing 
them to be driven with slow-transition-time signals. 


The PIO Output Enable Register permits the PIO signals to be operated as open-drain 
outputs. This is accomplished by keeping the appropriate POUT bits constant and 
writing data into the POEN field, so the output is either driving Low or is disabled, 
depending on the data. | 
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Figure 13-1 


This chapter describes the parallel port supported on the Am29200 and Am29205 
microcontrollers. Programmable registers and initialization are described, along with 
parallel port transfers from the host and to the host. 


OVERVIEW 


The parallel port supports asynchronous bidirectional parallel data transfers. It connects a 
host processor to the Am29200 or Am29205 microcontroller. The parallel port supports 
data transfers from the host to the microcontroller or from the microcontroller to the host. 
Data is transferred via an 8- or 32-bit external data register. Data is transferred to and 
from the external data register via processor access (programmed I/O) or DMA transfers. 
The Am29205 microcontroller does not support full word transfers on the parallel port. 


PROGRAMMABLE REGISTERS 


Parallel Port Control Register (PPCT, Address 800000CO) 
The Parallel Port Control Register (Figure 13-1) controls the parallel port. 


Parallel Port Control Register 
23 - 15 7 0 


res ! DRQ! DDIR 'FACK! BRS! AFD 
FWT _ TRA FBUSY DHH ARB 


31 





= Reserved on Am29205 microcontroller 


Bit 31: Reserved 
Bit 30: Full Word Transfer (FWT), Am29200 microcontroller—This bit controls 


_ whether the parallel port generates an interrupt or DMA request every handshake or 


every fourth handshake. When FWT is 0, a transfer of either 8 or 32 bits occurs on every 
handshake. When FWT is 1, a transfer of either 8 or 32 bits occurs.on every fourth 
handshake, reducing the demand the parallel port places on the processor. The FWT bit 
is reserved on the Am29205 microcontroller; it must be set to 0 to ensure proper 
operation. 


_ For proper transfer of data, external logic must assemble bytes from the parallel port | 


interface into an 8- or 32-bit external latch that implements the Parallel Port Data Register. 
The actual size of the transfer is determined by the width of the external latch: an 8-bit 
latch enables 8-bit transfers, and a 32-bit latch enables full word transfers. The DMA 
transfer or load/store instruction that reads/writes the Parallel Port Data Register must 
indicate the correct data width. Full word transfers are implemented only for transfers from 
the host. Full word transfers are not supported on the Am29205 microcontroller. 
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Bits 29-24: Reserved 


Bits 23-16: Transfer Delay (TDELAY)—During a transfer from the host, this field 
controls the duration of the assertion of PACK (and possibly PBUSY). During a transfer 
to the host, it controls the duration of data setup, PACK assertion, and data hold times. 





On transfers from the host, the TDELAY field specifies one less than the number of 
MEMCLK cycles in the duration interval. Setting TDELAY to 0 in this case will cause 
PACK to assert for one cycle. 


On transfers to the host, the TDELAY field species the number of MEMCLK cycles in 
the duration interval. In this case, if TDELAY is set to 0, PACK will not assert at alll. 


Bit 15: Data Request (DRQ)—This bit is set to indicate that the parallel port is ready for 
data to be read from or written to the Parallel Port Data Register. If so enabled by the 
MODE field, this bit being 1 generates an interrupt or DMA request to read or write data. 
This bit is reset when the Parallel Port Data Register is read or written. The DRQ bit is 


_ read-only, allowing other bits of the Parallel Port Control Register to be set (for example, | 


the FACK bit) without interfering with the data request. 


Bit 14: Transfer Active (TRA)—This bit is set at the beginning of a transfer on the 
parallel port and reset at the end of a transfer. It is read-only, so that setting other bits of 
the Parallel Port Control Register does not interfere with the indication of an active 
request. The TRA bit can be inspected by software to detect that a transfer is hung. 


Bits 13-11: Reserved 


Bit 10: Data Direction (DDIR)—This bit controls the direction of data transfer on the 
parallel port. If the DDIR bit is 0 (the default), data is received on the parallel port. If the 


DDIR bit is 1, data is transmitted on the parallel port. The MODE field must be 00 when 


the DDIR bit is changed. 


Bits 9-8: Parallel Port Mode (MODE)—This field enables the parallel port and controls 
the operational mode of the parallel port, as follows: 


MODE Value Effect on Parallel Port 
00 | Disabled 
01 Generate interrupt requests for service 
10 | Generate DMA Channel 0 requests 
11 Generate DMA Channel 1 requests 


_ Requests for service are requests to read or write the Parallel Port Data Register. 


Placing the parallel port into the disabled state causes all internal state machines to be 
reset, forces PACK Low, and holds the parallel port in an idle state. Parallel port | 
programmable registers are not affected when the port is disabled. 


Bit 7: Force Busy (FBUSY)—A 1 in this bit forces an active level on the PBUSY output. 
A 0 allows the PBUSY signal to operate normally. 


Bit 6: Force ACK (FACK)—A 1 in this bit forces an active level on the PACK output for 
one TDELAY interval. At the end of the interval, the FACK bit is reset and PACK is 
deasserted. 


Bit 5: Disable Hardware Handshake (DHH)—A 1 in this bit prevents the parallel 
port interface logic from controlling PACK or PBUSY. A 0 in this bit permits normal 
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handshaking with PACK and PBUSY. FACK and FBUSY may be used by software to 
control PACK and PBUSY regardless of the DHH bit. 


Bits 4—3: Reserved 


Bit 2: BUSY Relationship to STROBE (BRS)—This bit controls the relative timing of 
the PBUSY and PSTROBE hardware handshaking when the parallel port is receiving 
data. If BRS=0, PBUSY is asserted on the Low-to-High transition (leading edge) of 
PSTROBE. If BRS=1, PBUSY is asserted on the High-to-Low transition (trailing edge) of 
PSTROBE. The parallel port does not respond to PSTROBE until PBUSY is asserted, 
except that the TRA bit is always set on the leading edge of PSTROBE. 


Bit 1: ACK Relationship to BUSY (ARB)—This bit controls the relative timing of the 
PACK and PBUSY handshaking when the parallel port is receiving data. 

















lf ARB=0, PBUSY and PACK are asserted and deasserted at the same time (except for 
output driver skew). Both PACK and PBUSY are asserted at either the leading or trailing 
edge of PSTROBE, as controlled by the BRS bit. Both are deasserted together at the 
end of a transfer, which is usually at the end of a TDELAY interval. 





lf ARB=1, the PACK pulse follows the PBUSY pulse in time. PBUSY is asserted in 
response to an assertion of PSTROBE and is deasserted when the Parallel Port Data 
Register has been read and PSTROBE is Low. PACK is asserted at the same time 
PBUSY is deasserted and is deasserted at the end of a TDELAY interval. 


Bit 0: Autofeed (AFD)—This bit reflects the level on the PAUTOFD input. A 1 indicates 
PAUTOFD is active (High), and a 0 indicates PAUTOFD is inactive (Low). : 





Parallel Port Status Register (PPST, Address 800000C8) | 
The Parallel Port Status Register (Figure 13-2) indicates the status of the parallel port. 


Parallel Port Status Register 


31 23 
STB | ACK 


BSY 


Bit 31: PSTROBE Level (STB) This bit indicates the level on the PSTROBE signal. If 
PSTROBE is Low, this bit is 0; if PSTROBE is High, this bit is 1. 


Bits 30-24: Reserved 


Bits 23-16: TDELAY Counter Value (T DELAYV)—This field indicates the current value 
of the TDELAY counter used to time transitions of the handshaking signals. This value 
changes as the TDELAY interval is being timed. 


Bits 15—10: Reserved 


Bits 9-8: Byte Count (BCT)—When the FWT bit is 1, this field indicates the number of 
bytes (that is, the number of complete handshakes) received on the parallel port since 
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8 bits 


32 bits 
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the most recent data request. This information is useful for handling partial-word 
transfers at the end of a block transfer. 


Bit 7: PBUSY Level (BSY)—This bit indicates the level on the PBUSY signal. lf PBUSY 
is Low, this bit is 0; if PBUSY is High, this bit is 1. 


Bit 6: PACK Level (ACK)—This bit indicates the level on the PACK signal. If PACK is 
Low, this bit is 0; if PACK is High, this bit is 1. 





| Bits 5—0: Reserved 


Parallel Port Data Register (PPDT, Address 800000C4) 


The Parallel Port Data Register (Figure 13-3) is used to read from and write data to the 
parallel port. This register is not implemented directly on the processor, but rather is 
implemented by an external data latch connected to the parallel port interface cable. The 
processor converts an access of this register into an external access of the data latch. 


This access is similar to a PIA access, except the timing is fixed (see Section 13.3) and 





the access uses the signals POE and PWE to read and write the latch. 


Parallel Port Data Register 


31 23 15 7 0 
31 23 15 : 7 | 0 


PDATA 


Bits 7-0 (8-bit transfers) or 


Bits 31-0 (32-bit transfers): Parallel Port Data (PDATA), Am29200 microcontroller— 
This field contains the data being transferred to/from the microcontroller and the host 
over the parallel port. For transfers from the host, the width of this field is determined by 
the width of the external latch that implements the Parallel Port Data Register. However, 
the instruction or DMA channel that reads the parallel! port must also specify the correct 
data width to properly read the Parallel Port Data Register. Full word transfers are not 
supported on the Am29205 microcontroller. 


Initialization 
During a processor reset, the MODE field of the Parallel Port Control Register is reset to 


00 (disabled) and the FBUSY bit is set to 1, forcing PBUSY Low (busy). The parallel port 
must be configured by software before the parallel port is enabled. 





Writing the value 00 into the MODE field resets the parallel port, forces PACK Low, and 
forces PBUSY High (unless FBUSY is set). 





The I/O port signal PlIO15 may be used by the host to signal a change in the configura- 
tion of the parallel port. If the IRM15 field of the PIO Control Register has the value 11 
(see Section 12.2.1), PIO15 causes an edge-triggered interrupt and causes the FBUSY 
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bit to be set. Setting the FBUSY bit causes the parallel port to appear busy (PBUSY=0) 
to the host while the port’s configuration is changed. The FBUSY bit must be reset by 
software (if required) once configuration is complete. 


PARALLEL PORT TRANSFERS 


The parallel port does not attach directly to the microcontroller, but is attached to the . 
interface cable via buffers. Data must be latched in the interface using a three-state latch 
such as a 74L$374. The handshaking signals, PSTROBE, PAUTOFD, PACK, and 
PBUSY, are connected to the microcontroller via simple interface circuits. The inputs 





_PSTROBE and PAUTOFD should be connected to the processor via a Schmitt-trigger 


inverter such as a 74HCT14, and the outputs PACK and PBUSY should be connected to 
the host via an open-collector inverter such as a 7406. 


The hardware handshaking described in this section can be disabled by setting the DHH 
bit. If the DHH bit is 1, handshaking can be accomplished by software using the FACK 
and FBUSY bits. 


Transfers from the Host 


Figure 13-4 shows the state-transition diagram for transferring data from the host to the 
microcontroller over the parallel port. Figure 13-5 through Figure 13-8 show the timing 
diagrams for these transfers. The timing diagrams differ in the settings of the BRS and 
ARB bits. The timing diagrams also show the signals as they appear at the processor 
interface, and do not reflect the inversions in the buffers to the parallel-port connector. 


The host begins the transfer by placing data on the interface and asserting the 
PSTROBE signal. The data is latched in the interface on the rising edge of PSTROBE if 
BRS=0, and can be latched by either cee if BRS=1. The TRA bit is set on the leading 
edge of PSTROBE. 


The microcontroller asserts PBUSY within three MEMCLK cycles after the leading edge 
of PSTROBE (BRS=0) or within three MEMCLK cycles after the trailing edge of . 
PSTROBE (BRS=1). The microcontroller asserts PACK at the same time as PBUSY if 
ARB=0. The parallel port then generates either an interrupt request or a DMA request, 
as controlled by the MODE field, so the data can be read. If ARB=0, both PBUSY and 
PACK are deasserted once the TDELAY interval has expired, the Parallel Port Data 
Register (PDR) has been read, and the host has deasserted PSTROBE. If ARB=1, 
PBUSY is deasserted and PACK is asserted when the PDR has been read and 
PSTROBE is Low. PACK remains active until the TDELAY interval has expired. In any 
case, the TRA bit is reset when PACK is deasserted. 


The PDR is mapped to the external buffer register. Figure 13-9 shows the timing of the 
external access. This external access is treated as either a DMA access or a processor 
PIA access for the purpose of prioritization with other accesses. 


The PAUTOFD signal is used for software control during a transfer from the host. 
Software can detect the level on PAUTOFD by reading the AFD bit in the Parallel Port 
Control Register. 


Transfers to the Host 


Figure 13-10 shows the state transition diagram for transferring data from the microcon- 
troller to the host over the parallel port. Figure 13-11 shows the timing for this transfer. 
Transfers to the host are enabled by the host, using a system-dependent software 
protocol. This type of transfer is enabled in the processor by setting the DDIR bit in the 
Parallel Port Control Register. Setting the DDIR bit forces the PBUSY output active, 
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*PIO or DMA request is generated every 
fourth time if FWT=1 


preventing the host from transferring data to the microcontroller. The MODE bit must be 
00 when the DDIR bit is set or reset. 


The microcontroller begins the transfer by writing data to the external buffer. 


Figure 13-12 shows the timing for a buffer write. The buffer is written by either software 


_writing the Parallel Port Data Register or a DMA transfer that writes the Parallel Port — 


Data Register. The parallel port automatically generates the first DMA or interrupt 
request to write the data. Thereafter, the parallel port oe a DMA or lil 
request after it completes each transfer to the host. 


During a transfer to the host, the PAUTOFD signal is used to indicate that the host is 
busy and cannot accept data. PAUTOFD has the same polarity as PBUSY for this 
purpose. After the data buffer has been written, the parallel port waits for one TDELAY 
interval and then asserts PACK as soon as PAUTOFD is High and PSTROBE is Low 
(these signal conditions may hold before the interval expires). The TDELAY interval is 
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Figure 13-5 Transfer from the Host on the Parallel Port (BRS=0, ARB=0) 
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Figure 13-6 Transfer from the Host on the Parallel Port (BRS=0, ARB=1) 
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used to provide data setup time for the host. PACK is active for one TDELAY interval, 
then is deasserted. 


In response to PACK, the host acknowledges the transfer by asserting PSTROBE, which 
resets the TRA bit. PSTROBE has no fixed relationship to PACK. The host may also — 
assert PAUTOFD before the end of the transfer to indicate it is not ready for a subse- 
quent transfer. Following the deassertion of PACK or the assertion of PSTROBE 
(whichever is later), the parallel port waits one TDELAY interval to provide data hold time 
to the host. At the end of the interval, the parallel port generates a new DMA or interrupt 
request to have the data buffer written again, starting a new transfer. Software or the 
DMA channel may determine that all transfers have been made, and a new transfer 
does not start in this case. 
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Figure 13-8 Transfer from the Host on the Parallel Port (BRS=1, ARB=1) 
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Figure 13-10 State Transitions for Transfers to the Host 
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Figure 13-11 Transfer to the Host on the Parallel Port 
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This chapter describes the programmable registers of the serial port on the Am29200 
and Am29205 microcontrollers. 





14.1 OVERVIEW 


The on-chip serial port is a UART that permits full-duplex, bidirectional data transfer using 
the RS-232 standard. Serial port registers provide a programmable baud rate generator, 
odd/even parity capability, choice of word length, a test mode, and DMA access. 


The operations of the serial port are similar on the Am29200 and Am29205 microcontrol- 
lers, except that the DSR and DTR handshake signals are not available on the Am29205 
microcontroller. These functions, if needed, can be recreated with available PIO signals. 


14.2 PROGRAMMABLE REGISTERS 


14.2.1 Serial Port Control Register (SPCT, Address 80000080) 


The Serial Port Control Register (Figure 14-1) controls both the transmit and receive 
sections of the serial port. 


Figure 14-1 Serial Port Control Register 


31 : 23 15 7 0 
STP 


RSIE 





'BRK ! 
LOOP DSR © 


= Reserved on Am29205 microcontroller . 


Bits 31-27: Reserved 


Bit 26: Loopback (LOOP)—Setting this bit places the serial port in the loopback mode. In 
this mode, the TXD output is set High and the Transmit Shift Register is connected to the 
Receive Shift Register. Data transmitted by the transmit section is immediately received 
by the receive section. The loopback mode is provided for testing the serial port. 


Bit 25: Send Break (BRK)—Setting this bit causes the serial port to send a break, 
which is a continuous Low level on the TXD output for a duration of more than one frame | 
transmission time. The transmitter can be used to time the frame by setting the BRK bit 
when the transmitter is empty (indicated by the TEMT bit of the Serial Port Status 

- Register), writing the Serial Port Transmit Holding Register with data to be transmitted, 
and then waiting until the TEMT bit is set again before resetting the BRK bit. 


Bit 24: Data Set Ready (DSR), Am29200 microcontroller—Setting this bit causes the 
DSR output to be asserted. Resetting this bit causes the DSR output to be deasserted. 
This bit is reserved on the Am29205 microcontroller. 
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Bits 23-22: Reserved 


Bits 21-19: Parity Mode (PMODE)—This field specifies how parity generation and 
checking are performed during transmission and reception (the value “x” is a don’t care): 


PMODE Value Parity Generation and Checking 
Oxx gd No parity bit in frame 
100 Odd parity (odd number of is in ari) 
101 Even parity (even number of 1s in frame) 
110 Parity forced/checked as 1 


111 Parity forced/checked as 0 


Bit 18: Stop Bits (STP)—A 0 in this bit specifies that one stop bit is used to signify 
the end of a frame. A 1 in this bit specifies that two stop bits are used to signify the end 
of a frame. 


Bits 17-16: Word Length (WLGN)—This field indicates the number of data bits 
transmitted or received in a frame, as follows: 


WLGN Value Word Length 
00 5 bits 
01 6 bits 
10 7 bits 
11 | 8 bits 


Data words of less than eight bits are right-justified in the Transmit Holding Register and 
Receive Buffer Register. | . ; 


Bits 15—10: Reserved 


Bits 9-8: Transmit Mode (TMODE)—This field enables data transmission and controls 
the operational mode of the serial port for the transmission of data, as follows: 


TMODE Value Effect on Transmit Section 


00 Disabled | 

01 Generate interrupt requests for service 
10 Generate DMA Channel 0 requests 

11 Generate DMA Channel 1 requests 


Requests for service are requests to write the Transmit Holding Register with data to be 
transmitted. Placing the transmit section into the disabled state causes all internal state | 
machines to be reset and holds the transmit section in an idle state with TXD High. 
Serial port programmable registers are not affected when the transmit section is 
disabled. : 


Bits 7-3: Reserved 


Bit 2: Receive Status Interrupt Enable (RSIE)—This bit enables the serial port to 
generate an interrupt because of an exception during reception. If this bit is 1 and the 
serial port receives a break or experiences a framing error, parity error, or overrun error, 
the serial port generates a Receive Status interrupt. 
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Bits 1-0: Receive Mode (RMODE)—This field enables data reception and controls the 
operational mode of the serial port for the reception of data: 


RMODE Value | Effect on Receive Section 
00 Disabled 
01 Generate interrupt requests for service 
10 Generate DMA Channel 0 requests 
11 Generate DMA Channel 1 requests 


Requests for service are requests to read data from the Receive Buffer Register. Placing 
the receive section into the disabled state causes all internal state machines to be reset 
and holds the receive section in an idle state. Serial port programmable registers are not 
affected when the receive section is disabled. 


Serial Port Status Register (SPST, Address 80000084) 
The Serial Port Status Register (Figure 14- ae indicates the status of the transmit and 


‘receive sections of the serial port. 


Serial Port Status Register 





| ' § t 
‘THRE! ‘BRKI' PER! 
TEMT RDR DTR FER OER 


= Reserved on Am29205 microcontroller 


Bits 31-11: Reserved 


Bit 10: Transmitter Empty (TEMT)—This bit is 1 when the transmitter has no data to 
transmit and the Transmit Shift Register is mPUE This indicates to software it is safe to 
disable the transmit section. 


Bit 9: Transmit Holding Register Empty (THRE)—When the THRE bit is 1, the 
Transmit Holding Register does not contain valid data and can be written with data to be 
transmitted. When the THRE bit is 0, the Transmit Holding Register contains valid data 
not yet copied to the Transmit Shift Register for transmission and cannot be written. If so 
enabled by the TMODE field, the THRE bit causes an interrupt or DMA request when it 
is set. The THRE bit is reset automatically by writing the Transmit Holding Register. This 
bit is read-only, allowing other bits of the Serial Port Status Register to be written (for 
example, resetting the BRKI bit) without interfering with the data request. 


Bit 8: Receive Data Ready (RDR)—When the RDR bit is 1, the Receive Buffer Register 
contains data that has been received on the serial port, and can be read to obtain the 
data. When the RDR bit is 0, the Receive Buffer Register does not contain valid data. If 
so enabled by the RMODE field, the RDR bit causes an interrupt or DMA request when it 
is set. The RDR bit is reset automatically by reading the Receive Buffer Register. 


. Bits 7-5: Reserved 
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Bit 4: Data Terminal Ready (DTR), Am29200 microcontroller—The DTR bit indicates 
the level on the DTR pin. The DTR bit is 1 when the DTR pin is active; the DTR bit is 0 
when the DTR pin is inactive. This bit is reserved on the Am29205 microcontroller. 


Bit 3: Break Interrupt (BRKI)—The BRKI bit is set to indicate that a break has been 
received. If the RSIE bit is 1, the BRKI bit being set causes a Receive Status interrupt. 
The BRKI bit should be reset by the Receive Status interrupt handler. 


Bit 2: Framing Error (FER)—This bit is set to indicate that a framing error occurred 
during reception of data. If the RSIE bit is 1, the FER bit being set causes a Receive 


. Status interrupt. The FER bit should be reset by the Receive Status interrupt handler. 


Bit 1: Parity Error (PER)—This bit is set to indicate that a parity error occurred during 
reception of data. If the RSIE bit is-1, the PER bit being set causes a Receive Status 
interrupt. The PER bit should be reset by the Receive Status interrupt handler. 


Bit 0: Overrun Error (OER)—This bit is set to indicate that an overrun error occurred 
during reception of data. If the RSIE bit is 1, the OER bit being set causes a Receive 
Status interrupt. The OER bit should be reset by the Receive Status interrupt handler. 


Serial Port Transmit Holding Register (SPTH, Address 80000088) 
The processor writes this register (Figure 14-3) with data to be transmitted on the serial 


port. The transmitter is double-buffered, and the transmit section copies data from the 


Transmit Holding Register to the Transmit Shift Register (which is not accessible to 
software) before transmitting the data. 


Serial Port Transmit Holding Register 
31 23 15 


7 0 


Bits 31-8: Reserved 


Bits 7-0: Transmit Data (TDATA)—This field is written with data to be transmitted on 
the serial port. The THRE bit of the Serial Port Status Register should be 1 when this 
register is written, to avoid overwriting data already in the register. Writing this register 
causes the THRE bit to be reset. 


Serial Port Receive Buffer Register (SPRB, Address 8000008C) 


This register (Figure 14-4) contains data received over the serial port. The receiver is 
double-buffered, and the receive section can be receiving a subsequent frame of data in 
the Receive Shift Register (which is not accessible to software) while the Receive Buffer 


__ is being read by software or by a DMA channel. 


Serial Port Receive Buffer Register 
31 23 15 | a 2 0 


Serial Port 


14.2.5 | 


Figure 14-5 


AMD cl 


Bits 31-8: Reserved 


Bits 7-0: Receive Data (RDATA)—This field contains data received on the serial port. 
The RDR bit of the Serial Port Status Register should be 1 when this register is read, to 
avoid reading invalid data. Reading this register causes the RDR bit to be reset. 


Baud Rate Divisor Register (BAUD, Address 80000090) 

This register (Figure 14-5) specifies a clock divisor for the generation of a serial clock 
that controls the serial port. The UCLK (serial clock rate) is 16 times faster than the baud 
rate of the serial port. The Baud Rate Divisor Register specifies the zero-based number 
of UCLK cycles in one phase (half period) of the 16x serial clock. The formula for the 
baud rate is thus: 


Baud Rate = (Frequency of UCLK) + (BAUDDIV+1) + 32 


The maximum baud rate is 1/32 of INCLK and is achieved by tying UCLK to INCLK with 
BAUDDIV=0000; hexadecimal. — 


Baud Rate Divisor Register 
31 | 23 15 7 0 


14.2.6 


Bits 31-16: Reserved 


Bit 15—1: Baud Rate Divisor (BAUDDIV)—This field specifies the amount by which the 
UCLK input is divided to generate one phase of the serial clock. The serial clock | 
operates at 16 times the rate of transmission or reception of data. The BAUDDIV value 
is zero-based. For example, a value of two specifies a divisor of three. 


Initialization 


During a processor reset, both the TMODE and RMODE fields of the Serial Port Control 
Register are reset to 00, disabling the transmit and receive sections of the serial port. 
Software must initialize the serial port before it is enabled. 
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This chapter describes the bidirectional bit serializer/deserializer (known as the video 
interface) on the Am29200 and Am29205 microcontrollers. First the programmable 
registers of the video interface are described. This is followed by a discussion of video 
interface operation, including transmitting and receiving data. 


OVERVIEW 


The video interface provides direct connection to printer engine interfaces, raster input 
devices, and other serial-driven devices. The programmable interface allows direct 
connection to a large number of marking engines. Features of the interface include 
programmable image scan rates, along with programmable horizontal and vertical image 
margins. Video interface operations are the same on both the Am29200 and Am29205 
microcontrollers. 


PROGRAMMABLE REGISTERS 


Video Control Register (VCT, Address 800000E0) 
This register (see Figure 15-1) controls the operation of the video interface. 


Video Control Register 


31 23 15 . 7 0 


DRQ DDIR CLKI} PSIO! PSL { SDIR} 
res PS! LSI VIDI 


Bits 31-16: Reserved 


Bit 15: Data Request (DRQ)—This bit is set to indicate that the video interface is ready 
for data to be written to or read from the Video Data Holding Register. If so enabled by ~ 
the MODE field, this bit being set generates an interrupt or DMA request to write or read 
data. This bit is reset when the Video Data Holding Register is read or written. This bit is 
read-only, to allow other bits of the Video Control Register to be set (for example, the 
PSL bit) without interfering with the data request. 


Bits 14-11: Clock Divide (CLKDIV)—This field contains the divisor of the VCLK input 


_ used to generate the internal video clock. It specifies the number of VCLK periods in one 


phase (half period) of the internal video clock. For example, a value of 0001 indicates 
that one VCLK period constitutes one phase of the internal video clock—a divide by two. 
A value of 0000 causes VCLK to be used directly as the video clock. At the beginning of 
a video raster line, the clock divider is initialized so that, in the line, the first period of the 
internal clock is the correct number of VCLK periods. 
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Bit 10: Data Direction (DDIR)—This bit controls the direction of video data. If the DDIR 
bit is O, data is transmitted on the video interface. If the DDIR bit is 1, data is received on 
the video interface. 


Bits 9-8: Video Interface Mode (MODE)—This field enables the video interface and 
controls the operational mode of the video interface, as follows: 


MODE Value Effect on Video Interface 
00 Disabled 
01 Generate interrupt requests for service 
10 Generate DMA Channel 0 requests 
11 | Generate DMA Channel 1 requests 


Requests for service are requests to read or write the Video Data Holding Register. 
Placing the video interface into the disabled state causes all internal state machines to 
be reset and holds the video interface in an idle state. Video interface programmable 
registers are not affected when the interface is disabled. 


Bit 7: Clock Invert (CLKI)—If this bit is 0, the VDAT, PSYNC, and LSYNC pins are 
driven or sampled on the Low-to-High transition of the VCLK input. If this bit is 1, the 
VDAT, PSYNC, and LSYNC pins are driven or sampled on the High-to-Low transition of 
the VCLK input. 


Bit 6: Reserved 


Bit 5: Page Sync Input/Output (PSIO)—This bit determines whether or not PSYNC is 
an input or output. If this bit is 0, PSYNC is an input. If this bit is 1, PSYNC is an output. 


Bit 4: Page Sync Invert (PSI)—Ilf this bit is 0 and PSYNC is an input, a Low-to-High 
transition of the PSYNC input indicates the beginning of a page. If this bit is 1 and 
PSYNC is an input, a High-to-Low transition of the PSYNC my indicates the beginning 
ofa page. 


If this bit is O and PSYNC is an output, PSYNC i is noninverted with respect to the PSL 
bit. A PSL bit of 0 is reflected as a Low level, a PSL bit of 1 is reflected as a High level, 
and a page starts on a Low-to-High transition. If this bit is 1 and PSYNC is an output, 
PSYNC is inverted with respect to the PSL bit. A PSL bit of 0 is reflected as a High level, 
a PSL bit of 1 is reflected as a Low level, and a page starts on a High-to-Low transition. 


Bit 3: Page Sync Level (PSL)—When PSYNC is an input, this bit reflects the level on 
PSYNC. When PSYNC is an output, this bit determines the level on PSYNC. If PSI=0, a 
0 in this bit corresponds to a Low level on PSYNC and a 1 in this bit corresponds toa 
High level on PSYNC. If PSl=1, a 0 in this bit corresponds to a High level on PSYNC and 
a 1 in this bit corresponds to a Low level on PSYNC. 


Bit 2: Line Sync Invert (LSI)—If this bit is 0, a Low-to-High transition of the LSYNC 
input indicates the beginning of a line. If this bit is 1, a High-to-Low transition of the 
LSYNC input indicates the beginning of a line. 


Bit 1: Shift Direction (SDIR)—When this bit is 0, the Video Data Shift Register is shifted 
right to transfer data, with video data being shifted out of the least significant bit of the 
register (corresponding to bit O of the Video Data Holding Register) or into the most 
significant bit (corresponding to bit 31 of the Video Data Holding Register). When this bit 
is 1, the Video Data Shift Register is shifted left to transfer data, with video data being 
shifted out of the most significant bit of the register or into the least significant bit. 
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Bit 0: Video Invert (VIDI)—When this bit is 0, a 1 in the Video Data Shift Register 
corresponds to a High level on VDAT and a 0 in the Video Data Shift Register — 
corresponds to a Low level on VDAT. When this bit is 1, a 1 in the Video Data Shift 
Register corresponds to a Low level on VDAT and a 0 in the Video Data Shift Register 
corresponds to a High level on VDAT. 


Top Margin Register (TOP, Address 800000E4) 
This register (Figure 15-2) specifies the number of lines in the top margin of a page. 


Top Margin Register 
31 23 15 


7 0 


Bits 31-12: Reserved 


Bits 11-0: Top Margin Count (TOPCNT)—This field specifies the number of lines in the 
top margin. 


Side Margin Register (SIDE, Address 800000E8) 


This register (Figure 15-3) specifies the number of data bits in the left margin of a page 
and the number of bits in a raster line of video data. Together, this information sets the 
right and left margins of a page. 


Side Margin Register 
230 CO 15 7; 0) 


31 
LEFTCNT LINECNT 


Bits 31-28: Reserved 


Bits 27-16: Left Margin Count (LEFTCNT)—This field specifies the number of data bit 
equivalents in the left margin of a page. 


Bits 15-0: Line Count (LINECNT)—This field specifies the number of data bits ina 
raster line of video data. 


Video Data Holding Register (VDT, Address 800000EC) 


This register (Figure 15-4) contains data to be transmitted on or received from the video 
interface. Video data is double-buffered so data can be written to or read from the Video 
Data Holding Register while other data is transmitted from or received into the Video 
Data Shift Register. 
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Figure 15-4 Video Data Holding Register 
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31 23 15 7 0 


| VDATA 


Bits 31-0: Video Data (VDATA)—This field is written or read to transmit or receive data 
on the video interface. 


Initialization | 

During a processor reset, the MODE field of the Video Control Register is reset to 00. 
Software must configure the video interface before it is enabled. To prevent possible 
driver conflicts during reset, the PSIO bit is reset and the DDIR bit is set so both PSYNC 
and VDAT are inputs. To allow time for the interface signals to settle, the inputs and 
outputs should be configured before the interface is enabled. 


VIDEO INTERFACE OPERATION 

The operation of the video interface is synchronous to the VCLK input (see Section 
7.1.10), which clocks the video interface either directly or at a frequency multiple 
specified by the CLKDIV field. The CLKDIV field specifies the number of VCLK periods 
in one phase (half period) of the internal video clock. If the CLKDIV field has the value 
0000, the VCLK input is used directly. 


The following equations show how the CLKDIV field determines the internal video clock. 


lf CLKDIV = 0 
Internal Video Clock period = VCLK 


If CLKDIV > 0 

Internal Video Clock period = [VCLK + CLKDIV] * 2 

and | 
Internal Video Clock frequency = [VCLK +(CLKDIV * 2)] 


For example, assuming VCLK = 16 MHz = 62.5 ns period: 


CLKDIV internal Video Clock Period Internal Video Clock Frequency 


0 62.5 ns 16.0 MHz (VCLK) 
on (62.5 ns * 1} *2=125ns 8.0 MHz (1/2 of VCLK) 

2 [62.5 ns * 2} *2 =250ns 4.0 MHz (1/4 of VCLK) 

3 (62.5 ns * 3} *2=375ns 2.67 MHz (1/6 of VCLK) - 
4 (62.5 ns + 4] *2 =500ns 2.0 MHz (1/8 of VCLK) 


The progression continues in this way up to a value of 15 for CLKDIV. 


The clock divider circuit is initialized when the video interface is disabled, and does not 


operate until the interface is enabled by the MODE field. This circuit is also initialized by 
the transition of LSYNC that indicates the beginning of a line. Initializing the clock divider 
with LSYNC insures that the first internal clock period in the line is the indicated number 
of VCLK periods. The maximum frequency of VCLK is the same as the maximum 
frequency of INCLK. The maximum operating frequency of the video interface is the 
frequency of INCLK if the interface is used to output data. The maximum operating 
frequency is one-eighth of the frequency of INCLK if the interface is used to input data. 
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The PSYNC, LSYNC, and VDAT pins are driven and/or sampled during either the 
Low-to-High (CLKI=0) or High-to-Low (CLKI=1) transition of the VCLK input. The clock 
divider sequences on the same transition. If the clock is not divided down, new data can 
be driven or sampled on every active transition of VCLK. If the clock is divided down, 
new data can be driven or sampled on every CLKDIV-times-2 active transition of VOLK. 


Transmitting Data on the Video Interface 


Before the video interface is enabled to transmit, the Video Control Register should be 
set to configure the interface, and the Top Margin and Side Margin registers should be 
set with the appropriate counts. When the DDIR bit is 0 (VDAT is an output) and the 
video interface is disabled or is not transferring data, the VDAT output is held at a level 
corresponding to a 0 data bit (Low if VIDI=0 or High if VIDI=1). Once the video interface 
has been configured, it is enabled via the MODE field. | 


Enabling the video interface with DDIR=0 causes the interface to set the DRQ bit, 
thereby generating an interrupt or DMA request to write the Video Data Holding Register. 
Writing data into the Video Data Holding Register resets the DRQ bit. Data is transferred 
from the Video Data Holding Register to the Video Data Shift Register whenever the 
Video Data Shift Register is empty. After the transfer, the DRQ bit is set to request more 
data. Thus, the DRQ bit may be set very soon after the first data word is written. 
Thereafter, however, the DRQ bit will be set only as data is transmitted on the interface. 


A page cycle begins by an active transition of PSYNC, either as an input or output. At 
the beginning of a page cycle, three count-down registers are loaded from the TOPCNT, 
LEFTCNT, and LINECNT fields. The TOPCNT counter enables the transmission of the 
first raster line when it counts down to zero. The LEFTCNT counter enables the 
transmission of raster data on a line when it counts down to zero. The LINECNT counter 
enables the transmission of raster data as long as it is nonzero. 


After the page cycle begins, the counter registers are not enabled to count until the first 
active transition of LSYNC. An active transition of LSYNC indicates the beginning of a 
line. Because of internal synchronization delay, the video interface does not respond to 
LSYNC until five VCLK cycles have elapsed (see Figure 15-5). If the Video Data Shift 
Register is not empty, an active transition on LSYNC causes the TOPCNT counter to 
decrement by one (the TOPCNT field is unaffected). The TOPCNT counter continues to 
decrement by one on each active transition of LSYNC until it reaches zero. Note that if 
the TOPCNT field contains zero at the beginning of a page, the video interface begins 
transmitting on the first active transition of LSYNC. 


When the TOPCNT counter reaches zero, the interface is enabled to transmit the first 
raster line. At the beginning of the line, the LEFTCNT counter decrements on each 
active transition of the interface clock, beginning five VCLK cycles after the active edge 
of LSYNC, until the counter reaches zero. When the LEFTCNT counter reaches zero, 
the data in the selected end of the Video Data Shift Register is enabled to drive the 
VDAT output, and the LINECNT counter is enabled to count. The LEFTCNT counter is 
reloaded from the LEFTCNT field but does not count until the next active transition of 
LSYNC. If the LEFTCNT field contains zero at the beginning of a line, video data is 
driven and the LINECNT counter is enabled to count immediately on the fifth VCLK cycle 
after the first active transition of LSYNC, after the TOPCNT counter reaches zero. 


The first bit of video data is driven for a period of the interface clock, during the cycle in 
which the LEFTCNT counter reaches zero. On the next active transition of the clock, the 
Video Data Shift Register is shifted right (SDIR=0) or left (SDIR=1) by one bit and the 
new data driven on the VDAT output. Also, the LINECNT counter is decremented by 
one. When the last bit in the Video Data Shift Register has been transmitted, new data is 
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Figure 15-5 VCLK, LSYNC, and VDAT Relationships (CLKI=0, LSI=0 for example only) 
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loaded from the Video Data Holding Register and the DRQ bit is set to request more 
data. Data transmission continues until the LINECNT counter reaches zero. When the 
LINECNT counter reaches zero, the VDAT output is driven to correspond to a 0 data bit 
and the Video Data Shift Register is cleared. The LINECNT counter is reloaded but is 
not enabled to count until a new line begins and the LEFTCNT counter reaches zero 
once more. The VDAT output is held at a 0 data level and the Video Shift Register does 
not shift until the next line is transmitted. Clearing the Video Data Shift Register at the 
end of a line enables it to be reloaded with new data from the Video Data Holding 
Register as soon as this data is available. 


On each subsequent active transition of LSYNC, a subsequent line of data is trans- 
mitted. Each line begins with a synchronization period of five VCLK cycles, then a 
countdown of the LEFTCNT counter until it reaches zero, followed by data transmission | 
and shifting until the LINECNT counter reaches zero. On any active transition of LSYNC, 
if the Video Data Shift Register is empty, the page cycle ends and the video interface 
waits for the next active transition of PSYNC. 


15.3.2 Receiving Data on the Video Interface 


When the video interface is configured to receive data, the TOPCNT and LEFTCNT 
fields are not used, and the PSYNC pin is not used. Data reception is controlled by 
LSYNC, VCLK, and the LINECNT field. 


On the active edge of LSYNC, the LINECNT counter is loaded with the contents of the 
LINECNT field. On the fifth active edge of VCLK following the active edge of LSYNC (for 
synchronization), data is sampled into the selected end of the Video Data Shift Register, 
the register is shifted in the selected direction, and the LINECNT counter is decremented 
by one. When the Video Data Shift Register has received 32 bits, the contents of the 

_ register are transferred into the Video Data Holding Register and the DRQ bit is set to 
request that the data be read. Data sampling and shifting continue until the LINECNT 
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counter reaches zero. To clear the data at the end of a line after the LINECNT counter 
reaches zero, the data in the Video Data Shift Register is transferred into the Video Data 
Holding Register as soon as the holding register is available, and the DRQ bit is set. The 
interface waits for the next active transition of LSYNC before it accepts a new line of data. 
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This chapter describes how to use interrupts and traps to control the behavior of the 
Am29200 and Am29205 microcontrollers. Programmable registers are defined, as are 
vector numbers and interrupt and trap priorities. Interrupt and trap handling is discussed 
along with exception reporting and timer configuration. The chapter concludes with a 
description of the microcontroller’s. internal interrupt handler. | 


OVERVIEW 

The Am29200 and Am29205 microcontrollers employ a lightweight interrupt and trap 
facility that does not automatically save its current state in memory. Saving and restoring 
state information is under software control. Interrupts and traps are dispatched using a 
vector table that can be relocated in memory. 


On the Am29205 microcontroller, external traps and WARN are not supported. Only the 
INTRS-INTR2 inputs are available. The PIO signals can be used as additional interrupts 
when more inputs are required. 





INTERRUPTS AND TRAPS 


Interrupts and traps cause the Am29200 and Am29205 microcontrollers to suspend the — 
execution of an instruction sequence and to begin the execution of a new sequence. The 
processor may or may not later resume the execution of the original instruction sequence. 


The distinction between interrupts and traps ts largely one of causation and enabling. 
Interrupts allow external devices and the timer facility to control processor execution and 
are always asynchronous to program execution. Traps are intended to be used for 
certain exceptional events that occur during instruction execution and are generally 
synchronous to program execution. 


_ Adistinction is made between the point at which an interrupt or trap occurs and the point 


at which it is taken. An interrupt or trap is said to occur when all conditions that define 
the interrupt or trap are met. However, an interrupt or trap that occurs is not necessarily 
recognized by the processor, either because of various enables or because of the 
processor’s operational mode (e.g., Halt mode). An interrupt or trap is taken when the 
processor recognizes the interrupt or trap and alters its behavior accordingly. 


Current Processor Status Register (CPS, Register 2) 


This protected special-purpose register (see Figure 16-1) controls the behavior of the 
processor and its ability to recognize exceptional events. 


Figure 16-1 Current Processor Status Register 
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Bits 31-18: Reserved 


Bits 17: Timer Disable (TD)—When the TD bit is 1, the Timer interrupt is disabled. 
When this bit is 0, the Timer interrupt depends on the value of the IE bit of the Timer 
Reload Register. Note that Timer interrupts may be disabled by the DA bit regardless of 
the value of either TD or IE. The intent of this bit is to provide a means of disabling Timer 
interrupts without having to perform a non-atomic read- -modify-write operation on the 
Timer Reload Register. 


Bit 16—15: Reserved 


Bit 14: Interrupt Pending (IP)—This bit allows software to detect the presence of 
interrupts while the interrupts are disabled. The IP bit is set if an interrupt request is 
active, but the processor is disabled from taking the resulting interrupt due to the value 
of the DA, DI, or IM bits. If all interrupt requests are Subsequently deactivated while still 
disabled, the IP bit is reset. 


Bits 13-12: Trace Enable, Trace Pending (TE, TP)}—The TE and TP bits Faplemeat a 
software-controlled, instruction single-step facility. Single stepping is not implemented 
directly, but rather emulated by trap sequences controlled by these bits. The value of the 
TE bit is copied to the TP bit whenever an instruction completes execution. When the TP 
bit is 1, a Trace trap occurs. Section 17.2 describes the use of these bits in more detail. 


Bit 11: Trap Unaligned Access (TU)—The TU bit enables checking of address 
alignment for external data-memory accesses. When this bit is 1, an Unaligned Access 
trap occurs if the processor either generates an address for an external word not aligned 
on a word address-boundary (i.e., either of the least significant two bits is 1) or gener- 
ates an address for an external half-word not aligned on a half-word address boundary 
(i.e., the least significant address bit is 1). When the TU bit is 0, data-memory address 
alignment is ignored. 


Alignment is ignored for input/output accesses. The alignment of instruction addresses is 
also ignored (unaligned instruction addresses can be generated only by indirect jumps). 
Interrupt/trap vector addresses always are aligned properly by the processor. 


Bit 10: Freeze (FZ)—The FZ bit prevents certain registers from being updated during 
interrupt and trap processing, except by explicit data movement. The affected registers 
are: Channel Address, Channel Data, Channel Control, Program Counter 0, Program 
Counter 1, Program Counter 2, and the ALU Status Register. 


When the FZ bit is 1, these registers hold their values. An affected register can be 
changed only by a Move-To-Special-Register instruction. When the FZ bit is 0, there is 
no effect on these registers and they are updated by processor instruction execution as 
described in this manual. 


The FZ bit is set whenever an interrupt or trap is taken, holding critical state in the 


processor so it is not modified unintentionally by the interrupt or trap handler. © 


If the Freeze (FZ) bit of the Current Processor Status Register is reset from 1 to 0, two 
cycles are required before all program state is reflected properly in the registers affected 
by the FZ bit. This implies that interrupts and traps cannot be enabled until two cycles 
after the FZ bit is reset, for proper sequencing of program state. There is no delay 
associated with setting the FZ bit from 0 to 1. 


Bit 9-8: Reserved 
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Bit 7: Wait Mode (WM)—The WM bit places the processor in the Wait mode. When this 
bit is 1, the processor performs no operations. The Wait mode is reset by an interrupt or 
trap for which the processor is enabled, or by the assertion of the RESET pin. 


Bit 6—5: Reserved 


Bit 4: Supervisor Mode (SM)—The SM bit protects certain processor context, such as 
protected special-purpose registers. When this bit is 1, the processor is in the Supervisor 
mode and access to all processor context is allowed. When this bit is 0, the processor is 
in the User mode and access to protected processor context is not allowed. An attempt 
to access (either read or write) protected processor context causes a Protection 

Violation trap. 


Section 6.1 describes the processor state protected from User-mode access. 


Bits 3-2: interrupt Mask (IM)—The IM field is an encoding of the processor priority with - 
respect to external interrupts. The interpretation of the interrupt mask is specified in 
Section 16.2.2. 


Bit 1: Disable Interrupts (DI)—The DI bit prevents the processor from being interrupted 
by internal peripheral requests and by external interrupt requests INTR3—INTRO on the 
Am29200 microcontroller and INTR3-INTR2 on the Am29205 microcontroller. When this 
bit is 1, the processor ignores all internal and external interrupts. However, traps (both 
internal and external), Timer interrupts, and Trace traps may be taken. When this bit is 0, 
the processor takes any interrupt enabled by the IM field, unless the DA bit is 1. 


Bit 0: Disable All Interrupts and Traps (DA)}—The DA bit prevents the processor from 
taking any interrupts and most traps. When this bit is 1, the processor ignores interrupts 
and traps, except for the WARN trap on the Am29200 microcontroller. When the DA bit 
is O, all traps are taken; interrupts are taken if otherwise enabled. 








Interrupts 

Interrupts are caused by signals applied to any of the external INTRx inputs, by the timer 
facility (see Section 16.8), or by internal peripherals (see Section 16.9). The processor 
may be disabled from taking certain interrupts by the masking capability provided by the 





_ Disable All Interrupts and Traps (DA) bit, Disable Interrupts (Di) bit, and interrupt Mask 


(IM) field in the Current Processor Status Register. Timer interrupts may be disabled by 
the Timer Disable (TD) bit of the Current Processor Status Register. 


The DA bit disables all interrupts. The DI bit disables external interrupts and internal 
peripheral interrupts without affecting the recognition of traps and Timer interrupts. The 
2-bit IM field selectively enables external interrupts as follows: 


IM Value Result — Microcontroller 
00 RO enabled | Am29200 
01 R1—INTRO enabled Am29200 
10 R2-INTRO enabled . Am29200 
THD enabled Am29205 


TR3-INTRO and internal peripheral interrupts enabled Am29200 
TR3-INTR2 and internal peripheral interrupts enabled Am29205 


11 


Note that the INTRO interrupt cannot be disabled by the IM field. Also, no external 


interrupt is taken if either the DA or DI bit is 1. The Interrupt Pending bit in the Current 
Processor Status indicates that one or more interrupt requests is active, but the corre- 
sponding interrupt is disabled due to the value of either DA, DI, or IM. 
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Traps 

Traps are caused by signals applied to one of the Am29200 microcontroller inputs 
TRAP1—-TRAPO or by exceptional conditions such as protection violations. Traps are 
disabled by the DA bit in the Current Processor Status; a 1 in the DA bit disables traps, 
and a O enables traps. It is not possible to selectively disable individual traps. 





External Interrupts and Traps 


An external device causes an interrupt by asserting one of the INTRx inputs, and causes 
a trap by asserting one of the TRAP1—TRAPO inputs on the Am29200 microcontroller. 
Transitions on each of these inputs may be asynchronous to the processor clock; they 
are protected against metastable states. For this reason, an assertion of one of these 
inputs that meets the proper set-up-time criteria does not cause the corresponding 
interrupt or trap until the fourth following cycle. 








The INTRx inputs are prioritized with respect to each other and with respect to the 
processor. To resolve conflicts between these inputs, the inputs are prioritized in order, 
so the interrupt caused by INTRO has the highest priority and the interrupt caused by 
INTR3 has the lowest priority. 





The TRAP1—TRAPO inputs on the Am29200 microcontroller are prioritized with respect 
to each other, so the trap caused by TRAPO has priority over the trap caused by TRAP1 
when a conflict occurs. Both TRAPO and TRAP1 have priority over the INTR3—-INTRO 
inputs. The TRAP1—TRAPO inputs cannot be disabled selectively. Both traps, however, 
can be disabled by the DA bit in the Current Processor Status Register. 




















The INTRx and TRAP1—TRAPO inputs are level-sensitive. Once asserted, they must be 
held active until the corresponding interrupt or trap is acknowledged by the interrupt or 
trap handler. This acknowledgment is system-dependent, since there is no interrupt-ac- 
knowledge mechanism defined for the processor. : 


If any of these inputs is asserted, then deasserted before it is acknowledged, it is not 
possible to predict (unless the interrupt or trap is masked) whether or not the processor 
has taken the corresponding interrupt or trap. During interrupt and trap processing, the 
vector number is determined in part by which of the INTRx and TRAP1—TRAPO inputs is 
active. If the input causing an interrupt or trap is deasserted before the vector number is 
determined, the vector number is unpredictable and the processor operation is also 
unpredictable. Typically, this situation results in the processor taking an Illegal oeere 
trap. 











There is a five-cycle latency from the deassertion of an INTRx or TRAP1-TRAPO input 
to the time the corresponding interrupt or trap is no longer recognized by the processor. 
The latency is due to the metastability hardening that allows these signals to be driven 
with slow-transition-time signals. The deassertion must be timed so the processor is not 
recognizing the interrupt or trap by the time the corresponding mask is reset. Otherwise, 
a spurious interrupt or trap may occur. 


External traps are not supported on the Am29205 microcontroller. Only the 
INTR3—-INTR2 inputs are available for external interrupts. 





‘Wait Mode 


A wait-for-interrupt capability is provided by the Wait mode. The processor is in the Wait 
mode whenever the Wait Mode (WM) bit of the Current Processor Status is 1. While in 
Wait mode, the processor neither fetches nor executes instructions and performs no 
external accesses. The Wait mode is exited when an interrupt or trap is taken. 
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The processor can take only those interrupts or traps for which it is enabled, even in the 
Wait mode. For example, if the processor is in the Wait mode with the DA bit set to 1, it 
can leave the Wait mode only via a processor reset (see Section 2.9.2) or a WARN trap 
(see Section 16.5). 





VECTOR AREA 


Interrupt and trap processing relies on the existence of a user-managed vector area in 
external instruction/data memory. The vector area begins at an address specified by the 
Vector Area Base Address Register and provides for as many as 256 different interrupt 
and trap handling routines. The processor reserves 64 routines for system operation and 
instruction emulation. The number and definition of the remaining 192 possible routines 
are system dependent. | 


The structure of the vector area is a table of vectors in instruction/data memory. The 
layout of a single vector is shown in Figure 16-2. Each vector gives the beginning 
word-address of the associated interrupt or trap handling routine. 


Vector Table Entry 
31 23 15 7 0 


Handler Starting Address RE 
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Vector Area Base Address Register (VAB, Register 0) 


This protected special-purpose register (Figure 16-3) specifies the beginning address of 
the interrupt/trap vector area. The vector area is a table of 256 vectors that point to 
interrupt and trap handling routines. 


When an interrupt or trap is taken, the vector number for the interrupt or trap (See 
Section 16.3.2) replaces bits 9-2 of the value in the Vector Area Base Address Register 
to generate the physical address for a vector contained in instruction/data memory. 


Vector Area Base Address Register 


31 23 15 7 0 


Bits 31-10: Vector Area Base (VAB)—The VAB field gives the beginning physical 
address of the vector area. This address is constrained to begin on a 1-Kbyte address- 
boundary in instruction/data memory. 


Bits 9-0: Zeros—These bits force the alignment of the vector area to a 1-Kbyte 
boundary. 


Vector Numbers 


When an interrupt or trap is taken, the processor determines an 8-bit vector number 
associated with the interrupt or trap. The vector number gives the number of a vector 
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table entry. The physical address of the vector table entry is generated by replacing bits 
9-2 of the value in the Vector Area Base Address Register with the vector number. 


Vector numbers are either predefined or specified by an instruction causing the trap. The 
assignment of vector numbers is shown in Table 16-1 (vector numbers are in decimal 
notation). Vector numbers 64 to 255 are used by trapping instructions; the definition of 
the routines associated with these numbers is system dependent. 


INTERRUPT AND TRAP HANDLING 


Interrupt and trap handling consists of two distinct operations: taking the interrupt or trap 
and returning from the interrupt or trap handler. If the interrupt or trap handler returns 
directly to the interrupted routine, the interrupt or trap handler need not save and restore 
processor state. | 


Old Processor Status Register (OPS, Register 1) 


This protected special-purpose register has the same format as the Current Processor 
Status Register. The Old Processor Status Register stores a .copy of the Current 
Processor Status Register when an interrupt or trap is taken. This is required since the 
Current Processor Status Register is modified to reflect the status of the interrupt/trap 
handler. 


During an interrupt return, the Old Processor Status Register is copied into the Current 
Processor Status Register. This allows the Current Processor Status Register to be set 
as required for the routine that is the target of the interrupt return. 


Program Counter Stack 


The program counter unit, shown in Figure 16-4, forms and sequences instruction 
addresses for the instruction fetch unit. It contains the program counter (PC), the 
program-counter multiplexer (PC MUX), the return address latch, and the program- 
counter buffer. 


The PC forms addresses for sequential instructions executed by the processor. The 
master of the PC Register, PC L1, contains the address of the instruction being fetched 
in the instruction fetch unit. The slave of the PC Register, PC L2, contains the next 
sequential address, which may be fetched by the instruction fetch unit in the next cycle. 


The return address latch passes the address of the instruction following the delayed 
instruction of a call to the register file. This address is the return address of the call. 


The PC buffer stores the addresses of instructions in various stages of execution when 
an interrupt or trap is taken. The registers in this buffer—Program Counters 0, 1, and 2 


_ (PCO, PC1, and PC2)—are normally updated from the PC as instructions flow through 


the processor pipeline. 


When an interrupt or trap is taken, the Freeze (FZ) bit in the Current Processor Status 
Register is set, holding the quantities in the PC buffer. When the FZ bit is set, PCO, PC1, 
and PC2 contain the addresses of the instructions in the decode, execute, and write- 
back stages of the pipeline, respectively. 


Upon the execution of an interrupt return, the target instruction stream is restarted using 
the instruction addresses in PCO and PC1. Two registers are required here because the 
processor implements delayed branches. An interrupt or trap may be taken when the 
processor is executing the delay instruction of a branch and decoding the target of the 


_ branch. This discontinuous instruction sequence must be restarted properly upon an 


interrupt return. Restarting the instruction pipeline using two separate registers correctly 
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Table 16-1 Vector Number Assignments 
| Number Type of Trap or Interrupt Cause 
0 Illegal Opcode Executing undefined instruction! 
1 Unaligned Access Access on unnatural boundary, TU = 1 
2 Out-of-Range ‘ Overflow or underflow 
3-4 Reserved 
5 Protection Violation Invalid User-mode operation? 
6-7 Reserved | 
8 User Instruction Mapping Miss No DRAM mapping for access 
9 User Data Mapping Miss No DRAM mapping for access 
10 Supervisor Instruction Mapping Miss No DRAM mapping for access 
11 Supervisor Data Mapping Miss No DRAM mapping for access 
12-13 Reserved 
14 Timer Timer Facility 
15 Trace eS Facility 
16 INTRO4 INTI itis input 
17 INTR14 R1 input 
18 INTR2 1 input 
19 INTR3/Internal INT R3 input or internal peripheral 
20 TRAPO*4 APO input 
21 TRAP 14 TRAP input 
22 Floating-Point Exception ‘Unmasked floating-point exception? 
23 Reserved 
24-29 Reserved for instruction emulation 
(opcodes D8-DD) 
30 MULTM MULTM instruction 
31 MULTMU MULTMU instruction 
32 MULTIPLY MULTIPLY instruction 
33 DIVIDE DIVIDE instruction 
34 MULTIPLU MULTIPLU instruction 
35 DIVIDU DIVIDU instruction 
—686 CONVERT _ CONVERT instruction 
37 SQRT SQRT instruction 
38 CLASS CLASS instruction 
39-41 Reserved for instruction emulation 
(opcode lee) 
42 FEQ FEQ instruction 
43 DEQ DEQ instruction 
44 FGT FGT instruction 
45 DGT DGT instruction 
46 FGE FGE instruction 
47 DGE DGE instruction 
48 FADD FADD instruction 
49 DADD DADD instruction 
50 FSUB FSUB instruction 
51 DSUB DSUB instruction 
ney FMUL FMUL instruction 
Notes: 


1. 


2. 
3. 


This vector number also results if an external device removesINTRXx or TRAP 1-TRAPO before the corresponding interrupt or 
trap is taken by the processor (see Section 16.2.4). 


Some Supervisor-mode operations cause Protection Violations to facilitate virtualization of certain operations. 


The Floating-Point Exception trap is not generated by the processor hardware. It is generated by the software that implements 
the virtual arithmetic interface (see Section 2.8). 


Cannot be generated by the Am29205 microcontroller hardware. 
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Table 16-1 Vector Number Assignments (continued) 





Number Type of Trap or Interrupt Cause 
53 DMUL DMUL instruction 
54 FDIV | FDIV instruction 
55 DDIV DDIV instruction 
56 Reserved for instruction emulation 
(opcode F8) 
57 FDMUL lt FDMUL instruction 


58-63 Reserved for instruction emulation | 
(opcode FA-FF) | 


64-255 ASSERT and EMULATE instruction traps Note® 
| (vector number specified by instruction) 


Notes: (continued) | 

5. Some of Vector Numbers 64-255 are reserved for software compatibility (see Sections 4.2.3 and 4.2.6). These are docu- 
mented in Chapter 4 and in the Host Interface (HIF) Specification (included in the RISC Design-Made-Easy Application Guide, 
PID #16693A), available from AMD. | 
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handles this special case; in this case, PC1 points to the delay instruction of the branch, 
and PCO points to its target. PC2 does not participate in the interrupt return, but is 
included to report the addresses of instructions causing certain exceptions. _ 


The PC is not defined as a special-purpose register. It cannot be modified or inspected 
by instructions. Instead, the interrupting and restarting of the pipeline is done by the PC 
Buffer registers PCO and PC1. 


Program Counter 0 Register (PCO, Register 10) 


This protected special-purpose register (Figure 16-5) is used on an interrupt return to 
restart the instruction in the decode stage when the original interrupt or trap was taken. 


Program Counter O Register | 
31 23 15 7 0 


PCO | o 


Bits 31-2: Program Counter 0 (PC0)—This field captures the word-address of an 
instruction as it enters the decode stage of the processor pipeline, unless the Freeze 
(FZ) bit of the Current meses Status Register is 1. If the FZ bit is 1, PCO holds its 
value. | 


When an interrupt or trap is taken, the PCO field contains the word-address of the 
instruction in the decode stage. The interrupt or trap has prevented this instruction from 
executing. The processor uses the PCO field to restart this instruction on an interrupt 
return. 


Bits 1-0: Zeros—tThese bits are zero since instruction addresses are eae word 
aligned. 


Program Counter 1 Register (PC1, Register 11) 


This protected special-purpose register (Figure 16-6) is used on an interrupt return to 
restart the instruction in the execute stage when the original interrupt or trap was taken. 


Program Counter 1 Register 
31 23 15 7 0 


Bits 31-2: Program Counter 1 (PC1)—This field captures the word-address of an 


instruction as it enters the execute stage of the processor pipeline, unless the Freeze 
(FZ) bit of the Current Processor Status Register is 1. If the FZ bit is 1, PC1 holds its 
value. 


When an interrupt or trap is taken, the PC1 field contains the word-address of the 
instruction in the execute stage; the interrupt or trap has prevented this instruction from 
completing execution. The processor uses the PC1 re to restart this instruction on an 
interrupt return. 


Bits 1-0: Zeros—tThese bits are zero, since instruction addresses are always word 
aligned. 
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- Program Counter 2 Register (PC2, Register 12) 


This protected special-purpose register (Figure 16-7) reports the address of certain 
instructions causing traps. 


Program Counter 2 Register | 
31 23 15 7 0 


PC2 3 ff 


Bits 31-2: Program Counter 2 (PC2)—This field captures the word address of an 
instruction as it enters the write-back stage of the processor pipeline, unless the Freeze 
(FZ) bit of the Current Processor Status Register is 1. If the FZ bit is 1, PC2 holds its 
value. 


When an interrupt or trap is taken, the PC2 field contains the word address of the 
instruction in the write-back stage. In certain cases PC2 contains the address of the 
instruction causing a trap. The PC2 field is used to report the address of this instruction 
and has no other use in the processor. 


Bits 1-0: Zeros—These bits are zero since instruction addresses are always word 
aligned. 


Taking an Interrupt or Trap 


The following operations are performed in sequence by the processor when an interrupt 
or trap is taken: 


1. Instruction execution is suspended. 
2. Instruction fetching is suspended. 


3. Any in-progress load or store operation is completed. Any additional operations are 
canceled in the case of load multiple and store multiple. 


4. The contents of the Current Processor Status Register are copied into the Old Pro- 
cessor Status Register. | 


5. The Current Processor Status register is modified as shown in Figure 16-8 (the value 
- umeans unaffected). Note that setting the Freeze (FZ) bit freezes the Channel Ad- 
dress, Channel Data, Channel Control, Program Counter 0, Program Counter 1, Pro- 
gram Counter 2, and ALU Status Registers. 


6. The address of the first instruction of the interrupt or trap handler is determined. The 
address is obtained by accessing a vector from instruction/data memory, using the 
physical address obtained from the Vector Area Base Address Register and the vec- 
tor number. This is a 32-bit access. 


7. An instruction fetch is initiated using the instruction address determined in step 6. At 
this point, normal instruction execution resumes. 


Note that the processor does not explicitly save the contents of any registers when an 


interrupt is taken. If register saving is required, it is the responsibility of the interrupt- or 
trap-handling routine. For proper operation, registers must be saved before any further 
interrupts or traps may be taken. The FZ bit must be reset at least two instructions 
before interrupts or traps are re-enabled, to allow program state to be reflected properly 
in processor registers if an interrupt or trap is taken. 
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Figure 16-8 Current Processor Status After an Interrupt or Trap 
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Figure 16-9 
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Returning from an Interrupt or Trap 


Two instructions are used to resume the execution of an interrupted program: Interrupt 
Return (IRET), and Interrupt Return and Invalidate (IRETINV). These instructions are 
identical in the Am29200 and Am29205 microcontrollers; in other 29K Family proces- 
sors, the IRETINV instruction resets all Valid bits in an instruction cache, whereas the 
IRET instruction does not affect the Valid bits. 


In some situations, the processor state must be set properly by software before the 
interrupt return is executed. The following is a list of operations normally performed in 
such cases: ; 


1. 


The Current Processor Status Register is configured as shown in Figure 16-9 (the 
value xis a don’t care). Note that setting the FZ bit freezes the registers listed below 
so they may be set for the interrupt return. 


. The Old Processor Status Register is set to the value of the Current Processor Status 


for the target routine. 


. The Channel Address, Channel Data, and Channel Control registers are set to restart 


or resume uncompleted external accesses of the target routine. 


. The Program Counter 1 and Program Counter 0 registers are set to the addresses of 


the first and second instructions, respectively, to be executed in the target routine. 


. Other registers are set as required. These may include registers such as the ALU 


Status, Q, and so forth, depending on the particular situation. Some of these registers 
are unaffected by the FZ bit so they must be set in such a manner that they are not 
modified unintentionally before the interrupt return. | 


Current Processor Status Before Interrupt Return 
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Once the processor registers are configured properly, as described above, an interrupt 
return instruction (IRET or IRETINV) performs the remaining steps necessary to return 
to the target routine. The following operations are performed by the interrupt return 
instruction: 


1. 


Any in-progress load or store operation is completed. If a load-multiple or store-multi- 
ple sequence is in progress, the interrupt return is not executed until the sequence 
completes. 


. Interrupts and traps are disabled, regardless of the settings of the DA, DI, and IM 


fields of the Current Processor Status, for steps 3 through 10. 


. The contents of the Old Processor Status Register are copied into the Current Pro- 


cessor Status Register. This normally resets the FZ bit, allowing the Program Counter 
0, 1, 2, Channel Address, Data, Control, and ALU Status registers to update normally. 
Since certain bits of the Current Processor Status Register always are updated by the 


_ processor, this copy operation may be irrelevant for certain bits (e.g., the Interrupt 


Pending bit). 


. If the Contents Valid (CV) bit of the Channel Control Register is 1, and the Not Needed 


(NN) and Multiple Operation (ML) bits are both 0, an external access is started. This 
operation is based on the contents of the Channel Address, Channel Data, and Chan- 
nel Control registers. The Current Processor Status Register conditions the access as 
usual. Load-multiple and store-multiple operations are not restarted at this point. 


. The address in Program Counter 1 is used to fetch an instruction. The Current Pro- 


cessor Status Register conditions the fetch. This step is treated as a branch in the 
processor pipeline. | 


. The instruction fetched in step 5 enters the decode stage of the pipeline. 


7. The address in Program Counter 0 is used to fetch an instruction. The Current Pro- 


cessor Status Register conditions the fetch. This step is treated as a branch in the 
processor pipeline. 


. The instruction fetched in step 5 enters the execute stage of the pipeline, and the 


instruction fetched in step 7 enters the decode stage. 


. If the CV bit in the Channel Control Register is a 1, the NN bit is 0, and the ML bit is 


1, a load-multiple or store-multiple sequence is started based on the contents of the 
Channel Address, Channel Data, and Channel Control registers. 


10. Interrupts and traps are enabled per the appropriate bits in the Current Processor 


Status Register. 


11. The processor resumes normal operation. 


Lightweight Interrupt Processing 


The registers affected by the FZ bit of the Current Processor Status Register are those 
modified by almost any usual sequence of instructions. Since the FZ bit is set by an 
interrupt or trap, the interrupt or trap handler is able to execute while not disturbing the 
state of the interrupted routine, though its execution is somewhat restricted. Thus, it is. 
not necessary in many cases for the interrupt or trap handler to save the registers 
affected by the FZ bit. This permits the implementation of lightweight interrupt handlers 
that do not have all of the overhead normally associated with interrupt handlers. 


The processor provides an additional benefit to lightweight interrupts if the Program 
Counter 0 and Program Counter 1 Registers are not modified by the interrupt or trap 
handler. lf Program Counters 0 and 1 contain the addresses of sequential instructions 
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when an interrupt or trap is taken, and if they are not modified before an interrupt return 
is executed, step 7 of the interrupt return sequence in Section 16.4.4 occurs as a 
sequential fetch—instead of a branch—for the interrupt return. The performance impact 
of a sequential fetch is normally less than that of a branch. 


Because the registers affected by the FZ bit are sometimes required for instruction 
execution, it is not possible for the lightweight interrupt or trap handler to execute all 
instructions, unless the required registers are first saved elsewhere (e.g., in one or more 
global registers). Most of the restrictions due to register dependencies are obvious (e.g., 
the Byte Pointer for byte extracts) and will not be discussed here. Other less obvious 
restrictions are listed below: 


m Load Multiple and Store Multiple. The Channel Address, Channel Data, and Channel 
Control registers are used to sequence load-multiple and store-multiple operations, 
so these instructions cannot be executed while the registers are frozen. However, 
other external accesses may cccur; the Channel Address, Channel Data, and Chan- 
nel Control registers are required only to restart an access after an exception, and the 
interrupt or trap handler is not expected to encounter any exceptions. 


m= Loads and stores that set the Byte Pointer. If the SB bit of a load or store instruction is 
1 and the FZ bit is also 1, there is no effect on the Byte Pointer. Thus, the execution 
of external byte and half-word accesses using this mechanism is not possible. 


@ Extended arithmetic. The eceny bit of the ALU Status Register is not updated while 
the FZ bit is 1. 


m= Divide step instructions. The Divide Flag of the ALU Status Register is not updated 
when the FZ bit is 1. 


If the interrupt or trap handler does not save the state of the interrupted routine, it cannot 
allow additional interrupts and traps. Also, the operation of the interrupt or trap handler 
cannot depend on any trapping instructions (é.g., floating-point instructions, assert 
instructions, illegal operation codes, arithmetic overflow, etc.), since these are disabled. 
There are certain cases, however, where traps are unavoidable. Special considerations | 
for these cases are discussed in Section 16.7.6. 


Simulation of Interrupts and Traps 


Assert instructions may be used by a Supervisor-mode program to simulate the occur- 
rence of various interrupts and traps defined for the processor. Only an assert instruction 
executed in Supervisor mode can specify a vector number between 0 and 63. If this 
instruction causes a trap, the effect is to create an interrupt or trap similar to that 
associated with the specified vector number. 


a 


Thus, the interrupt and trap routines defined for basic processor operation can be 
invoked without creating any particular hardware condition. For example, on the 
Am29200 microcontroller, an INTR1 interrupt may be simulated by an assert instruction 
that specifies a vector number of 17, without the activation of the INTR1 signal. 





‘WARN TRAP (Am29200 Microcontroller) 





The processor recognizes a special trap, caused by the activation of the WARN input, 
that cannot be masked. The WARN trap is intended to be used for severe system-error 
or deadlock conditions. It allows the processor to be placed in a known, operable state, 
while preserving much of its original state for error reporting and possible recovery. 
Therefore, it shares some features in common with the Reset mode as well as features 





_ common to other traps described in this section. 
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The major differences between the WARN trap and other traps are: 


m™ The processor does not wait for an in-progress external access to complete before tak- 
ing the trap, since this access might not complete (for example, because WAIT is as- 
serted). However, the information related to any outstanding access is retained by the 
Channel Address, Channel Data, and Channel Control registers when the trap is taken. 


m The vector-fetch operation is not performed when the WARN trap is taken. Instead, 
instruction fetching begins immediately at address 16. 


Note that the WARN trap may disrupt the state of the routine that is executing when it is . 
taken, prohibiting this routine from being restarted. WARN may also reset some internal 
peripherals. 








WARN Input (Am29200 Microcontroller) 


An inactive-to-active transition on the WARN input causes a WARN trap to be taken by 
the processor. The WARN trap cannot be disabled; the processor responds to the 
WARN input regardless of its internal condition unless the RESET input is also asserted. 
The WARN input is provided so the system can gain control of the processor in extreme 
situations, such as when system power is about to be removed or when a severe 
non-recoverable error occurs. | | 











The WARN input is edge-sensitive so an active level on the WARN input for long 
intervals does not cause the processor to take multiple WARN traps. However, WARN 
must be held active for at least four cycles in order to be properly recognized by the 
processor. The processor still takes the WARN trap if WARN is deasserted after four 
cycles. Another WARN trap occurs if WARN makes another inactive-to-active transition. 

















The processor enters the Executing mode when the WARN input is asserted, regardless 
of its previous operational mode. Either seven or eight cycles after WARN is asserted 
(depending on internal synchronization time), the processor performs a trap-handler 
instruction access on the bus. This access is directed to address 16. 


SEQUENCING OF INTERRUPTS AND TRAPS 
On every cycle, the processor decides either to execute instructions or to take an 


_interrupt or trap. Since there are multiple sources of interrupts and traps, more than one 
interrupt or trap may be pending on a given cycle. | 


To resolve conflicts, interrupts and traps are taken according to the priority shown in 
Table 16-2. In this table, interrupts and traps are listed in order of decreasing priority. 
This section discusses the first three columns of Table 16-2. The last two columns are 
discussed in Section 16.7. 


In Table 16-2, interrupts and traps fall into one of two categories depending on the timing 
of their occurrence relative to instruction execution. These categories are indicated in 
the third column of Table 16-2 by the labels /nst and Async. These labels have the 
following meaning: . | | 


 =Inst—Generated by the execution or attempted execution of an instruction. 


m Async—Generated asynchronous to and independent of the instruction being 
executed, although it may be a result of an instruction executed previously. 
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Table 16-2 Interrupt and Trap Priority Table 


Priority 
1 
(Highest) 


2 


10 


11 


12 


13 
(Lowest) 


Notes: 


Type of Interrupt or Trap 
WARN2 


User-Mode Data Mapping Miss 
Supervisor-Mode Data Mapping Miss 


Unaligned Access 

Out-of-Range 

Assert Instructions 

Floating-Point Instructions 

Integer Multiply/Divide Instructions 
EMULATE 


INTR3 
Internal peripheral interrupts 


Timer 
Trace 


User-mode Inst Mapping Miss 
Supervisor-mode Inst Mapping Miss 


Illegal Opcode 
Protection Violation 


Inst/Async 
Async 


Inst . 
_ Inst 


inst 
Inst 
inst 
Inst 
Inst 
Inst 


Async 
Async 
Async 
Async 
Async 


Async 
Async 


Async 
Async 


Inst 
Inst 


inst 
Inst 


PC1 
Next 


Next 
Next 


Next 
Next 
Next 
Next 
Next 
Next 


Next 
Next . 


| Next 


Next 


Next 


Next 


Next 


Next 


Next 


Curr 
Curr 


Curr 
Curr 
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Channel Regs 


Note’ 


All 
All 


All 

N/A 
N/A 
N/A 
N/A 
N/A 


Multiple 
Multiple 
Multiple 
Multiple 
Multiple 


Multiple 
Multiple 


Multiple 
Multiple 


N/A 
N/A 


N/A 
N/A 


1. The Channel Address, Channel Data, and Channel Control registers are set foraWARN trap onthe Am29200 microcontroller | 
only if an external access is in progress when the trap is taken. 


2. Not supported on the Am29205 microcontroller. 
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The principle for interrupt and trap sequencing is that the highest priority interrupt or trap 
is taken first. Other interrupts and traps either remain active until they can be taken or 
they are regenerated when they can be taken. This is accomplished depending on the 
type of interrupt or trap, as follows: 


1. All traps in Table 16-2 with priority 13 myougi 15 are regenerated by the re-execution 
of the causing instruction. 


2. Most of the interrupts and traps of priority 4 through 12 must be held by external hard- 
, ware until they are taken. The exceptions to this are listed in item 3. 


3. The exceptions to item 2 are the Timer interrupt and the Trace trap. These are 
caused by bits in various registers in the processor and are held by these registers 
until taken or cleared. The two relevant bits are the Interrupt (IN) bit of the Timer Re- 
load Register for Timer interrupts and the Trace Pending (TP) bit of the Current Pro- 
cessor Status Register for Trace traps. | 


4. All traps of priority 2 and 3 in Table 16-2, except for the Unaligned Access trap, are not 
regenerated. These traps are mutually exclusive and are given high priority because they 
cannot be regenerated; they must be taken if they occur. If one of these traps occurs at 
the same time as a reset or WARN trap, it is not taken and its occurrence is lost. 





5. The Unaligned Access trap is regenerated internally when an external access is re- 
started by the Channel Address, Channel Data, and Channel Control registers. Note 
this trap is not necessarily exclusive to the traps discussed in item 4 above. 


The Channel Address, Channel Data, and Channel Control registers are set fora WARN 
trap only if an external access is in:progress when the trap is taken. 


EXCEPTION REPORTING AND RESTARTING 


When an instruction encounters an exceptional condition, the Program Counter 0, 
Program Counter 1, and Program Counter 2 registers report the relevant instruction 
address(es) and allow the instruction sequence to be restarted once the exceptional 
condition has been remedied (if possible). Similarly, when an external access encoun- 
ters an exceptional condition, the Channel Address, Channel! Data, and Channel Control 
registers report information on the access or transfer and allow it to be restarted. This 
section describes the interpretation and use of these registers. 


The PC? column in Table 16-2 describes the Value held in the Program Counter 1 


- Register (PC1) when the interrupt or trap is taken. For traps in the Inst category, PC1 


contains either the address of the instruction causing the trap, indicated by Curr, or the 
address of the instruction following the instruction causing the trap, indicated by Next. 


For interrupts and traps in the Async category, PC1 contains the address of the first 
instruction not executed due to the taking of the interrupt or trap. This is the next 
instruction to be executed upon interrupt return, as indicated by Next in the PC1 column. 


| Instruction Exceptions 


For traps caused by the execution of an instruction (e.g., the Out-of-Range trap), the 
Program Counter 2 Register contains the address of the instruction causing the trap. In 
all of these cases, PC1 is in the Next category. 


The traps associated with instruction fetches (i.e., those of priority 13) occur only if the 
processor attempts the execution of the associated instruction. An exception may be 
detected during an instruction prefetch, but the associated trap does not occur if the 
processor branches before it attempts to execute the invalid instruction. This prevents 
spurious instruction exceptions. 
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Restarting Mapped DRAM Accesses 


DRAM mapping is provided to support application needs such as on-the-fly data 
compression and decompression. In such applications, programs operate on large, 
compressed data structures by decompressing data into a smaller region of memory, 
operating on the data, and then compressing back into the large compressed structure. 
The ability to store the data in a compressed format reduces system memory require- 
ments, while the ability to operate on the data in a decompressed format simplifies the 
application software. 


For generality, mapped DRAM accesses allow the mapping configuration to be changed 
on demand. In other words, the DRAM mapping Is performed by a system routine that 
changes the mapping as needed by the application program. This allows applications 
written with no knowledge of DRAM mapping to operate in a system that uses DRAM 
mapping. Since the DRAM mapping trap is part of normal system operation and does 
not represent an error, the access that causes the trap must be restarted—once the 
trapping condition is remedied—in a manner that cannot be detected by the program 
causing the trap. 


The Am29200 and Am29205 microcontrollers overlap external accesses with the 
execution of instructions. Thus, traps caused by accesses are imprecise. The address of 
the instruction that initiated the access cannot be determined by the trap handler. Since 
the address of the initiating instruction is unknown, the access cannot be restarted by 
re-executing this instruction. Even if the address could be determined, the instruction 
might not be restartable since an instruction executed before the trap occurred, but after 


_ the access began, may have altered the conditions of the access, such as by altering © 


the address source register. 


In order to provide for the restarting of loads and stores that cause exceptions, the 
processor saves all information required to restart these accesses in the Channel 


_ Address, Channel Data, and Channel Control registers. The Contents Valid (CV) and 


Not Needed (NN) bits in the Channel Control Register indicate that the information 
contained in these registers represents an access that must be restarted. The CV bit 
indicates the access did not complete, and the NN bit indicates whether or not the data 
from the access is required by the processor. | 


Note that since instruction execution is overlapped with external accesses, an instruction 
that executes after a load may alter the destination register for the load. If a trap occurs 
in this situation, the access information in the Channel Address, Channel Data, and 
Channel Control registers is correct, but the load cannot be restarted because it will » 
destroy the new value in the destination register. The NN bit provides correct operation 
in this case. 


When an interrupt or trap is taken, the handling routine has access to the Channel 
Address, Channel Data, and Channel Control registers. The contents of these registers 
may contain information relevant to an incomplete access and can be preserved for 
restarting this access. Since these registers are frozen (due to the FZ bit of the Current 
Processor Status) they are not available to monitor any external accesses in the 
interrupt or trap handler until their contents are saved and the FZ bit is reset. 


Upon an interrupt return (IRET or IRETINV), the processor restarts an access using the 


‘Channel Address, Channel Data, and Channel Control registers. The access is initiated 


if the CV bit of the Channel Control Register is 1 and the NN bit is 0. The restart cannot 
be detected in the logical operation of the restarted routine, although the timing of 


- execution is altered. 
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The mechanism used to restart trapping accesses has the additional benefit of allowing 
a fast interrupt-response time when the processor is performing a load-multiple or 
store-multiple operation. An interrupted load-multiple or store-multiple is restarted as if it 
had faulted. In this case, the operation resumes from the point of interruption, not from 
the beginning of the sequence. 


16.7.2.1 Channel Address Register (CHA, Register 4) 


This protected special-purpose register (Figure 16-10) is used to report exceptions 
during external accesses. It is also used to restart interrupted load-multiple and store- 
multiple operations and to restart other external accesses when possible (e.g., after 

DRAM mapping misses are serviced). 


The Channel Address Register is updated on the execution of every load or store 
instruction and on every load or store in a load-multiple or store-multiple sequence, 
~ except when the Freeze (FZ) bit in the Current Processor Status Register is 1. 


Figure 16-10 Channel Address Register | 
31 23 15 7 0 


Bits 31-0: Channel Address (CHA)—This field contains the address of the current 
access (if the FZ bit of the Current Processor Status Register is 0). 


16.7.2.2 Channel Data Register (CHD, Register 5) 


This protected special-purpose register (Figure 16-11) is used to report exceptions 

_ during external accesses. It is also used to restart the first store of an interrupted 
store-multiple operation and to restart other external accesses when possible (e.g., after 
DRAM mapping misses are serviced). 


The Channel Data Register is updated on the execution of every load or store instruction 
and on every load or store in a load-multiple or store-multiple sequence, except when 
the Freeze (FZ) bit in the Current Processor Status Register is 1. When the Channel 
Data Register is updated for a load operation, the resulting value is unpredictable. 


Figure 16-11 Channel Data Register 


© 


31 23 15 | 7 
CHD 


Bits 31-0: Channel Data (CHD)}—This field contains the data (if any) associated with 
the current access (if the FZ bit of the Current Processor Status negister is 0). If the 
current access is not a Store, the value of this field is irrelevant. 


16.7.2.3 Channel Control Register (CHC, Register 6) 


This protected special-purpose register (Figure 16-12) is used to report exceptions 
during external accesses. It is also used to restart interrupted load-multiple and store- 
multiple operations and to restart other external accesses when possible (e.g., after 
DRAM mapping misses are serviced). 
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The Channel Control Register is updated on the execution of every load or store 
instruction and on every load or store in a load-multiple or store-multiple sequence, 
except when the Freeze (FZ) bit in the Current Processor Status Register is 1. 


Figure 16-12 Channel Control Register 


16.7.3 





Bits 31—24:—These bits are a direct copy of bits 23-16 from the load or store instruction 
that started the current access (see Section 3.3). 


_ Bits 23-16: Load/Store Count Remaining (CR)}—The CR field indicates the remaining 


number of transfers for a load-multiple or store-multiple operation that encountered an 
exception or was interrupted before completion. This number is zero-based; for 
example, a value of 28 in this field indicates that 29 transfers remain to be completed. 


Bit 15: Load/Store (LS)—The LS bit is 0 if the access is a store operation and is 1 if the 
access is a load operation. 


Bit 14: Multiple Operation (ML)—The ML bit is 1 if the current access is a partially- 
complete load-multiple or store-multiple operation; otherwise it is 0. 


Bit 13: Set (ST)—The ST bit is 1 if the current access is for a Load and Set instruction; 
otherwise it is 0. 


Bit 12—10: Reserved 


Bits 9-2: Target Register (TR)—The TR field indicates the absolute register number of 
the data operand for the current access (either a load target or store data source). Since 
the register number in this field is absolute, it reflects the Stack-Pointer addition when 
the indicated register is a local register. | 


Bit 1: Not Needed (NN)—The NN bit indicates that even though the Channel Address, 
Channel Data, and Channel Control registers contain a valid representation of an 
incomplete load operation, the data requested is not needed. This situation arises when 
a load instruction is overlapped with an instruction that writes the load target register. 


Bit 0: Contents Valid (CV)}—The CV bit indicates the contents of the Channel Address, 
Channel Data, and Channel Control registers are valid. 


integer Exceptions | 

Some integer add and subtract instructions—ADDS, ADDU, ADDCS, ADDCU, SUBS, 
SUBU, SUBCS, SUBCU, SUBRS, SUBRU, SUBRCS, and SUBRCU—cause an 
Out-of-Range trap upon overflow or underflow of a 32-bit signed or unsigned result, 
depending on the instruction. | 


Two integer multiply instructions—MULTIPLY and MULTIPLU—cause an Out-of-Range 
trap upon overflow of a 32-bit signed or unsigned result, respectively, if the MO bit of the 
Integer Environment Register is 0. If the MO bit is 1, these multiply instructions cannot 
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cause an Out-of-Range trap. Since the processor does not contain hardware to directly 
support these instructions, the Out-of-Range trap must be generated by the software 
that implements the virtual arithmetic interface (see Section 2.8). 


Two integer divide instructions—DIVIDE and DIVIDU—take the Out-of-Range trap upon 
overflow of a 32-bit signed or unsigned result, respectively, if the DO bit of the Integer 
Environment Register is 0. If the DO bit is 1, the divide instructions cannot cause an 
Out-of-Range trap unless the divisor is zero. If the divisor is zero, an Out-of-Range trap 
always occurs, regardless of the DO bit. 


For the MULTIPLY, MULTIPLU, DIVIDE, and DIVIDU instructions, the destination 
register (or registers) is unchanged if an Out-of-Range trap is taken. 


Floating-Point Exceptions 


A Floating-Point Exception trap occurs when an exception is detected during a floating- 
point operation and the exception is not masked by the corresponding bit of the Floating- 
Point Mask Register. In this context, a floating-point operation is defined as any 
operation that accepts a floating-point number as a source operand, that produces a 
floating-point result, or both. Thus, for example, the CONVERT instruction may create 
an exception while attempting to convert a Hoang point value to an integer value or vice 
versa. 


In addition to the operations described in Section 16.4.3, the following operations are 
performed when a Floating-Point Exception trap is taken: 


1. The status of the trapping operation is written into the trap status bits of the Floating- 
Point Status Register. The written status bits do not depend on the values of the cor- 
responding mask bits in the Floating-Point Environment Register. 


2. The destination register or registers are left unchanged. 


Correcting Out-of-Range Results 


Some Arithmetic instructions cause an Out-of-Range trap if the arithmetic operation 
causes an overflow or underflow. When an Out-of-Range trap occurs, the result of the 
operation, though incorrect, is written into the destination register. Furthermore, the 
Program Counter 2 Register contains the address of the trapping instruction, and the 
ALU Status Register contains an indication of the cause of the trap. It is possible, if 
required, for the trap handler to use this information to form the correct result. 


The ALU Status indicates the cause of the Out- of- -Range trap based on the operation 
performed, as follows: 


1. Signed overflow. If the Out-of-Range trap is caused by signed, two’s-complement 
overflow (this can occur for both signed adds and subtracts), the V bit is 1. 


2. Unsigned overtlow. If the Out-of-Range trap is caused by unsigned overflow (this can 
occur only for unsigned adds), the C bit is 1. 


3. Unsigned underflow. If the Out-of- -Range trap is caused by unsigned underflow (this 
can occur only for unsigned subtracts), the C bit is 0. 


The multiply instructions, MULTIPLY and MULTIPLU, can cause an Out-of-Range trap if 
the MO bit of the Integer Environment Register is 0 and the operation overflows. 
However, these instructions do not set the ALU Status Register. This exception is 
detected by reading the trapping instruction whose address is in the PC2 Register. 
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Exceptions During Interrupt and Trap Handling 


In most cases, interrupt and trap handling routines are executed with the DA bit in the 
Current Processor Status having a value of 1. It is normally assumed these routines do 
not create many of the exceptions possible in most other processor routines. 


lf these assumptions are not valid for a particular interrupt or trap handler, the handler 
must save the state of the processor and reset the FZ bit of the Current Processor 
Status so the handler itself may be restarted properly. This must be accomplished before 
any interrupts or traps can be taken. In this case, the state (or the state of some other 
process) must be restored before an interrupt return is executed. 


TIMER FACILITY 
The processor has a built-in timer facility that can be configured to cause periodic 


‘ interrupts. The timer facility consists of two special-purpose registers—the timer counter 


and the timer reload registers—accessible only to Supervisor-mode programs. Also, the 
Current Processor Status Register contains a control bit as part of the timer facility. 
These registers implement timing functions independent of program execution. 


Timer Facility Operation 

The Timer Counter Register has a 24-bit Timer Count Value (TCV) field that decrements 
by one on every processor cycle. If the TCV field decrements to zero, it is written with 
the Timer Reload Value (TRV) field of the Timer Reload Register on the next cycle; the 
Interrupt (IN) bit of the Timer Reload register is set at the same time. Reloading the TCV 
field by the TRV field maintains the accuracy of the timer facility. 


The Timer Reload Register contains the 24-bit TRV field and the control bits Overflow 
(OV), Interrupt (IN), and Interrupt Enable (IE). If the IN bit is 1 and the IE bit also 1,a 
Timer interrupt occurs. If the IN bit is 1 when the TCV field decrements to zero, the OV 
bit is also set. The OV bit indicates a Timer interrupt may have occurred before a 
previous interrupt was serviced. | 


The Current Processor Status Register contains the Timer Disable (TD) control bit. If the 
TD bit is 1, Timer interrupts are disabled. The TD bit and the IE bit have equivalent 
functions; the TD bit is provided so the timer may be disabled without having to perform 
a non-atomic read-modify-write operation on the Timer Reload Register. There is a 
possibility the TCV might decrement to zero and set the IN bit as the modified value is 
written back to the Timer Reload Register, causing a Timer interrupt to be missed. 


Timer Facility Initialization 

To initialize the timer facility, the following steps should be taken in the specified order (it 
is assumed that Timer interrupts are disabled by the DA bit of the Current Processor 
Status Register or the TD bit of the Current Processor Status Register during the 
following steps): 


1. Set the TCV field with the desired interval count for the first timing interval. This inter- 
val must be sufficiently large to allow the execution of the next step before the TCV 
field decrements to zero (this normally is the case). 


2. Set the TRV field with the desired interval count for the second timing interval. The 
OV and IN bits are reset and the IE bit is set as desired. The second timing interval 
may be equivalent to the first timing interval. 
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Handling Timer Interrupts 
The following is a suggested list of actions necessary to handle a Timer interrupt: 


1. Read the Timer Reload Register into a general-purpose register. 
2. Reset the IN bit in the general-purpose register. 


3. Set the TRV field in the general-purpose register to the desired value for the next tim- 
ing interval. Note that at this time the timer counter is timing the current interval. This 
step may be omitted if all intervals are equivalent. 


4. Write the contents of the general-purpose register back into the Timer Reload 
Register. 


5. Test the general-purpose-register copy of the OV bit and, if it is set, report the error as 
appropriate. 


6. Perform any system operations required for the Timer interrupt. 
7. Execute an interrupt return. 


Timer Facility Uses 


Since the timer facility has a resolution of a single processor cycle, it may be used to 
perform precise timing of system events. For example, it may be used to determine an 
exact measurement of the number of cycles between two events in the system or to 
perform precise time-critical control functions. The Timer interrupt is enabled and 
disabled separately from other processor interrupts So its priority can be specified. 


The timer facility can be shared among multiple processes. This sharing is accomplished 
by the implementation of a queue for timer events, which are sorted in order of increas- 
ing event time. On each occurrence of a Timer interrupt, the TRV field is set for the 
interval between the next two events in the queue, while the Timer Counter Register is 
counting the current interval (because of a previous setting of the TRV field). The event 
at the beginning of the queue identifies other system actions to be taken for the Timer 
interrupt. This event is removed from the queue after the appropriate actions are taken. 


Timer Counter Register (TMC, Register 8) 


This protected special-purpose register (Figure 16- -13) contains the counter for the timer 
facility. 


Figure 16-13 Timer Counter Register 
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Bits 31-24: Reserved 


Bits 23-0: Timer Count Value (TCV)—The 24-bit TCV field decrements by one on each 
processor clock. When the TCV field decrements to zero, it is reloaded with the content 
of the Timer Reload Value field in the Timer Reload Register. At this time, the Interrupt 
bit in the Timer Reload Register is set. 
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The TCV field is zero-based with respect to the Timer interrupt interval; for example, a 
value of 28 in the TCV field causes the IN bit to be set in the 29th subsequent pioges2ot 
cycle. The TCV field is zero for a complete cycle before the IN bit is set. 


Timer Reload Reaister (TMR, Register 9) 


This protected special-purpose register (Figure 16-14) maintains synchronization of the 
Timer Counter Register, enables Timer interrupts, and maintains timer facility status 
information. 


Figure 16-14 Timer Reload aioe 
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Bits 31-27: Reserved 


Bit 26: Overflow (OV)—The OV bit indicates a Timer interrupt occurred before a previous 
Timer interrupt was serviced. It is set if the Interrupt (IN) bit is1 when the Timer Count 
Value (TCV) field of the Timer Counter Register decrements to zero. In this case, a Timer 
interrupt caused by the IN bit has not been serviced when another interrupt is created. 


Bit 25: Interrupt (IN)}—The IN bit is set whenever the TCV field decrements to zero. If 
this bit is 1 and the IE bit is also 1, a Timer interrupt occurs. The IN bit is set when the 
TCV field decrements to zero, regardless of the value of ie IE bit. The IN bit is reset by 


_ software that handles the Timer interrupt. 


Bit 24: Interrupt Enable (IE}—When the IE bit is 1, the Timer interrupt is enabled and 
the Timer interrupt occurs whenever the IN bit is 1. When this bit is 0, the Timer interrupt 
is disabled. The Timer interrupt may be disabled by the DA bit of the Current Processor 
Status Register regardless of the value of the IE bit. The Timer interrupt can also be 
disabled by the TD bit of the CPS Register, regardless of the value of IE and/or DA. 


Bits 23-0: Timer Reload Value (TRV)—The value of this field is written into the Timer 
Count Value (TCV) field of the Timer Counter Register when the TCV field decrements 
to zero. 


INTERNAL INTERRUPT CONTROLLER 


The various peripherals and controllers on the Am29200 and Am29205 microcontrollers 
can cause interrupts having the same effect on the processor as asserting the proces- 
sor’s INTR@ input. The interrupt controller provides a central location for generating 
interrupts, indicating which interrupts are active and permitting software to reset the 
interrupts independent of servicing the interrupting peripheral. 


Interrupt Control Register (ICT, Address 80000028) 


Bits of the Interrupt Control Register (Figure 16-15) are set at the leading edge of an 
interrupt condition, except for the bits related to the I/O port (in the IOPI field), since I/O 
port signals are independently configurable to generate edge-triggered interrupts. For 
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example, the DMAOI bit is set when the CTI bit transitions from 0 to 1 in the DMAO 
Control Register. When a bit in this register is 1, it causes an internal assertion of the 
processor’s INTR@ input (there is no external indication of this on INTR3). Software can 
inspect this register to determine the source of the interrupt and can reset bits in this 
register to clear the interrupt. 





Bits in the Interrupt Control Register are reset-only. Writing a 1 into a bit position causes 
the bit to be reset unless an interrupting condition becomes active at the same time, in 
which case the bit remains set. Writing a bit with 0 does not affect the bit, and the bit 
may be set by an interrupting condition at the same time the bit is written with 0. 


Figure 16-15 Interrupt Control Register 
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Bits 31-28: Reserved 


Bit 27: Video Interrupt (VDI)—A 1 in this bit indicates the video interface has generated 
an interrupt request. 


Bits 26—24: Reserved 


Bits 23-16: I/O Port Interrupt (IOPI)—A 1 in this field indicates the respective PIO 
signal has generated an interrupt request. A 1 in the most significant bit of the IOPI field 
indicates PIO15 has caused an interrupt, the next bit indicates PlO14 has caused an 


_interrupt, and so on. 


Bit 15: Reserved 


Bit 14: DMA Channel 0 Interrupt (DMAOI)—A 1 in this bit indicates DMA Channel 0 
has generated an interrupt request. 


Bit 13: DMA Channel 1 Interrupt (DMA1I)—A 1 in this bit indicates DMA Channel 1 
has generated an interrupt request. 


Bit 12: Reserved 


Bit 11: Parallel Port Interrupt (PPI)—A 1 in this bit indicates the parallel port has 
generated an interrupt request. 


Bits 10-8: Reserved 


Bit 7: Serial Port Receive Status Interrupt (RXSI)—A 1 in this bit indicates the serial 
port has generated an interrupt request because of the status of the receive logic: 


Bit 6: Serial Port Receive Data Interrupt (RXDI)—A 1 in this bit indicates the serial 
port has generated an interrupt request because receive data is ready. 


Bit 5: Serial Port Transmit Data Interrupt (TXDI)—A 1 in this bit indicates the serial 
port has generated an interrupt request because the Transmit Holding Register is empty. 


Bits 4—0: Reserved 
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Interrupt Controller Initialization 


Processor interrupts are disabled by a processor reset, but the Interrupt Control Register 
is not affected by a reset. To prevent spurious interrupts, software should reset all bits of 
the Interrupt Control Register to 0 before processor interrupts are enabled. 


Servicing Internal Interrupts | 
The Interrupt Control Register allows software to determine the source of an internal 


‘interrupt. Software can prioritize these interrupts using the processor’s Count Leading 


Zeros instruction. 


Software clears an interrupt by writing a 1 into the bit that is causing the interrupt 
(normally, the leading 1-bit in the Interrupt Contro! Register). For level-sensitive I/O port 
interrupts, the interrupting condition must be cleared and the corresponding PIO signal 
be in an inactive state before the Interrupt Control Register bit is cleared, otherwise 


_ another interrupt will be generated. 


For other types of interrupts, the condition causing the interrupt can be cleared in the 
interrupting peripheral independent of resetting the bit in the Interrupt Control Register, 
because the leading edge of the condition must be detected again before another 
interrupt can occur. However, the interrupt should not be cleared in a way that might lose 
the occurrence of a newly generated interrupt. Because the Interrupt Control Register is 
reset-only and because resetting a bit takes lower precedence than setting a bit, bits can © 
be reset without interfering with other interrupts or with the detection of a new interrupt of 
the type being cleared. 
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This chapter details the features of the Am29200 and Am29205 microcontrollers that 

. support debugging and testing. The chapter first describes the trace facility and instruc- 
tion breakpoints that aid in software debugging. Next, the test/development interface, the 
test access port, and the boundary-scan architecture are discussed. A description of 
how to use an Am29200 microcontroller to emulate an Am29205 microcontroller 
concludes the chapter. 


17.1 OVERVIEW 


The Am29200 microcontroller provides debugging and testing features at both the 
hardware and software levels. Instruction tracing and instruction breakpoints are 
supported. The microcontroller’s test development interface is composed of pins that 
indicate the state of the processor and control its operation. A JTAG-compliant test | 
access port facilitates system testing in a production environment. | 


The Am29205 microcontroller supports software debugging only. Hardware testing and 
debugging can be accomplished by using an Am29200 microcontroller to emulate an 
Am29205 microcontroller. 


17.2 TRACE FACILITY 


Software debug for the Am29200 and Am29205 microcontrollers is supported by the 
trace facility. The trace facility guarantees exactly one trap after the execution of any 

_instruction in a program being tested. This allows a debug routine to follow the execution 
of instructions and to determine the state of the Bese and system at the end of each 
instruction. 


Tracing is controlled by the Trace Enable (TE) and Trace Pending (TP) bits of the 
Current Processor Status Register. The value of the TE bit is always copied into the TP 
bit when an instruction enters the write-back stage of the processor pipeline. A Trace 
trap occurs whenever the TP bit is 1. As with most traps, the Trace trap can be disabled 
only by the DA bit of the Current Processor Status Register. 


In order to trace the execution of a program, the debug routine performs an interrupt 
return to cause the program to begin or resume execution. However, before the interrupt 
return is executed, the TE and TP bits of the Old Processor Status Register are set with 
the values 1 and 0, respectively. The interrupt return causes these bits to be copied into 
‘the TE and TP bits of the Current Processor Status Register. 


When the target instruction of the interrupt return (whose address is contained in the © 
Program Counter 1 Register when the interrupt return is executed) enters the write-back 
Stage, the processor copies the value of the TE bit into the TP bit. Since the TP bitis a 1, 
a Trace trap occurs. This trap prevents any further instruction execution in the target 
routine until the interrupt is taken and the routine is resumed with an interrupt return. 
When the Trace trap is taken, the TE and TP bits are both reset automatically, prevent- 
ing any further Trace traps. 


since the trace facility is managed by the Old and Current Processor Status registers, it 
operates properly in the event the processor takes an interrupt or trap—unrelated to the 
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trace facility—before the above trace sequence completes. When the unrelated interrupt 
or trap is taken, the state of the trace facility (i.e., the values of the TE and TP bits) is 
copied into the Old Processor Status Register from the Current Processor Status 
Register. The trace facility then resumes operation when the interrupted routine is 
restarted by an interrupt return. 


‘It is possible to cause a Trace trap by directly setting the TP and/or TE bits in the 


Current Processor Status Register. This may be accomplished only by a Supervisor- 
mode program. 


INSTRUCTION BREAKPOINTS 


The HALT instruction can be used as an instruction breakpoint by a hardware-develop- 
ment system. However, the HALT instruction normally is a privileged instruction, causing 


a Protection Violation trap upon attempted execution by a User-mode program. The 


hardware-development system can disable this Protection Violation trap as outlined in 
Section 17.7.5. 


The assert class of instructions and the Illegal Opcode trap can be used by software to 
implement instruction breakpoints. An instruction breakpoint is set by replacing an 
instruction with the assert instruction or an illegal opcode in the program under test. 
When the breakpoint instruction is encountered, the instruction breakpoint causes a trap. 
The illegal opcode is preferred since the Program Counter 1 (PC1) points to the illegal 
opcode when the trap is taken, whereas PC1 points to the instruction following the 
breakpoint if an assert instruction is used. 


PROCESSOR STATUS OUTPUTS (Am29200 MICROCONTROLLER) 


The STAT2-STATO outputs on the Am29200 microcontroller indicate certain information 
about processor modes along with information about processor operation. 
STAT2—STATO may be used to provide feedback of processor behavior during normal 
processor operation and when the processor is under the control of a hardware- 
development system. 


The encoding of STAT2—STATO is as follows: 


STAT2 STAT1 STATO Condition 


Halt or Step modes 
Interrupt/trap vector fetch (vector valid) 

~ Load Test Instruction mode, Halt/Freeze 
Branch target fetch (instruction valid) 
External data access (data valid) 
External instruction access (instruction valid) 
Internal peripheral access (data valid) 
Idle or data/instruction not valid © 


mht ot ot OOOO 
=a = OO ~~ &§ © OO 
=O +" O = Oo = ~ 


The STAT2-STATO outputs are a delayed indication of a mode or condition, so a mode 


or condition on a given cycle is reflected on STAT2—STATO on the second following 
cycle. For example, STAT2—STATO=100 indicates an external data access was valid on 
ID31—IDO at the end of the second previous cycle. The R/W output indicates the 
direction of an access, adding information about the access indicated on STAT2-STATO 
(the R/W signal appears with the access and is not delayed as are STAT2-STATO). If an 
access is extended by WAIT, the appropriate status is shown for every additional cycle 
until the access does complete. The address always appears on A23—A0, whether the 
access is a read or a write and whether the access is external or internal (that is, to an 
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internal peripheral). The data appears on 1D31-IDO, except on a read of an internal 
peripheral. 


17.5 CONTROL FIELD IN SCAN PATH (Am29200 MICROCONTROLLER) 


The Am29200 microcontroller incorporates a boundary-scan interface that is compatible 
with the IEEE 1149.1 JTAG specification. This interface permits access to testing 

and debug features of the processor core not visible on the external interface (see 
Section 17.6.2). 


A two-bit CNTL field appears on the Boundary Scan Register. The CNTL field corre- 
sponds to the CNTL1—-CNTLO pins that appear on other 29K family processors. This 
field can be used to halt, step, and start the processor, as well as force the processor to 
execute instructions for the purpose of testing and debugging. The CNTL field affects 
processor operations as follows: 


CNTL Value - Mode 
00 Load Test Instruction 
01 Step 
10 | Halt 
11 Normal 


Changes to the CNTL field are restricted so that only one bit of the CNTL field may 
change at any given time. The allowed transitions are shown in Figure 17-1. If these 
restrictions are violated, processor operation is unpredictable and a processor reset is 
required to resume predictable operation. 


Figure 17-1 Valid Transitions for CNTL Field 






Load Test 
Instruction 
00 
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Figure 17-2 


Because of the restrictions just described, it is not possible to transition directly between 
all possible modes controlled by the CNTL field. For example, the processor cannot go 
from the Load Test Instruction mode to Normal operation without first entering the Halt or 
Step modes. | 


TEST ACCESS PORT (Am29200 MICROCONTROLLER) 


The Am29200 microcontroller implements the Standard Test Access Port (TAP) and 
Boundary-Scan Architecture as specified by the IEEE Specification 1149.1—1990 
(JTAG), with the-exception that the INCLK pin is not part of the boundary-scan register. 
The IEEE 1149.1-—1990 Specification includes many details omitted from the discussion 
in this section and is included by reference. The following description discusses 
Am29200 microcontroller-specific considerations. 


Boundary-Scan Cells 


The test access port can access, affect, and sample the processor inputs and outputs © 
because a Boundary Scan Register (BSR) and Parallel Data Register (PDR) are 
incorporated into the design of the input and output cells. The Boundary Scan Register 
allows serial data to be loaded into or read out of the processor input/output boundary. 
The Parallel Data Register holds data stable at inputs and outputs ee scanning, so 
system signals are not adversely affected during scanning. 


An input or output cell incorporating a BSR and PDR register bit is referred to as a 
boundary-scan cell. This section describes the implementation of the Am29200 micro- 
controller boundary-scan cells. 


Figure 17-2 shows the design of an input boundary-scan cell, and Figure 17-3 shows the 
design of an output boundary-scan cell. Bidirectional signals use both of these designs 
in the same cell. Multiplexor selects, when active, select the lower multiplexor input. 


Input Boundary-Scan Cell 


Scan Output 
EXTEST, 
input Pin Lil 
xX M Input Signal 
U to Logic 
X . 
— fae 
M B 
U S 
X R 
. Shift Update 
Scan Input Clock Clock 


Note: For the CNTL field, the boundary-scan value is also selected by the ICTEST1 instruction. 
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Figure 17-3. Output Boundary-Scan Cell 


Scan Output 
EXTEST, Output 
INTEST © feet] Enable 
_ Output Signal | | 
from Logic M Output Pin 
x P. x 
X ZN 
SCAN pee 
M B 
U S 
X R 
/\ {\ 
Shift Update 
Clock Clock 


Scan Input 


The shift and update clocks, when used to sample or drive processor and system 
signals, are synchronized to the processor internal clocks so all signals (except the TAP 
signals) are sampled or driven synchronously to system clocks. However, the shift and 
update clocks still satisfy the JTAG constraints that inputs are sampled after the.rising 
edge of TCK, outputs change after the falling edge of TCK, and TCK is the only control 
needed to affect sampling and driving. 


The IEEE 1149.1-1990 Specification requires that it be possible to force the processor 
three-state outputs to be enabled. This is accomplished by cells that have no associated 
pin. The outputs of these cells force groups of output drivers to be enabled. Some 
outputs can be disabled by these cells even though the outputs cannot be disabled 
during normal operation (for example, the A23—A0 outputs can be disabled). 


The boundary-scan cells for the CNTL1—CNTLO field and STAT2—STATO outputs are 
part of the BSR and are accessible by scanning the BSR. However, they can also be 
scanned individually using the ICTEST1 instruction (see Section 17.6.2). If the ICTEST1 
instruction is active, no other boundary-scan cell is scanned. However, the contents of 
the other scan cells are undefined after this operation. 


The INCLK input is not a boundary-scan cell. The clocks to the processor must continue 
to operate even if the test access port is active. However, a fault on this input is readily 
visible in the operation of the test access port. 


The MEMCLK pin has an output boundary-scan cell. The EXTEST and INTEST 
instructions hold the MEMCLK signal at a fixed logic level for the duration of the 
instruction. 7 


Debugging and Testing 17-5 


cl AMD | 
17.6.2 


1 7.6.2.1 


17.6.2.2 


17-6 


Instruction Register and Implemented Instructions 


The Instruction Register (REG) of the test access port is a three-bit register. The least 
significant bit (IREGO) is the bit nearest the TDO output. Instructions are encoded as 
follows: 


IREG2 IREG1 IREGO Instruction 


0 0 0 EXTEST 

0 0 1 Preloaded value (acts like BYPASS) 
0 1 0. ICTEST2 | : 

0 1 1 Reserved (acts like BYPASS) 

1 0 0 INTEST 

1 0 1 SAMPLE 

1 1 0 ICTEST1 

1 1 1 BYPASS 


The EXTEST, BYPASS, INTEST, and SAMPLE instructions are specified by the 
1149.1—1990 Specification. Reserved instructions behave as BYPASS instructions to 
conform to the specification. ICTEST1 and ICTEST2 are AMD public instructions. 


Most of these instructions are described in detail in the IEEE 1149.1—1990 Specification. 
Below is a brief description of the special considerations in the Am29200 microcontroller 
implementation. 


EXTEST 


The EXTEST instruction is provided for external continuity and logic tests. It allows the 
test access port to drive outputs and sample inputs. 


EXTEST selects the Boundary Scan Register (BSR) for scanning. During execution 

1. Processor outputs are driven from the Parallel Data Register (PDR). | 

2. Processor internal output signals are sampled into the BSR. This is default behavior. 

3. Processor inputs are sampled into the BSR. | 

4. Processor internal input signals are driven from the PDR. This prevents internal logic 
from seeing invalid combinations of input signals possibly received from other chips 
during the test. | 

INTEST 


The INTEST instruction is provided to test the processor’s internal logic. Its primary 
value is to allow a hardware-development system to drive the processor's test interface 
without a direct electrical connection to all pins of the package. 


INTEST selects the BSR for scanning. During execution 


1. Processor outputs are driven from the PDR. This prevents external logic from seeing 
invalid combinations of output signals. 


2. Processor internal output signals are sampled into the BSR. 
3. Processor inputs are sampled into the BSR. This is default behavior. 
4. Processor internal input signals are driven from the PDR. | 
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The INTEST instruction allows the hardware-development system to alter and inspect 
internal registers, using processor load and store instructions, without having the 
external system see any bus activity. 


SAMPLE 


The SAMPLE instruction is provided to inspect the processor’s external signals without 
interfering with system operations. 


SAMPLE selects the BSR for scanning. During execution 


1. Processor outputs are driven by the processor. 

2. Processor internal output signals are sampled into the BSR. 

3. Processor inputs are sampled into the BSR. 

4. Processor internal input signals are driven from the processor inputs. 


ICTEST1 


The ICTEST1 instruction is defined for AMD processors using the extension mecha- 
nisms permitted by the IEEE 1149.1-1990 Specification. It is provided to drive the CNTL 
field and sample the STAT2-STATO outputs while leaving other inputs and outputs in 
their normal system connection. This allows a hardware-development system to control 


- the processor and system using the test access port. 


ICTEST1 selects a subset of the BSR for scanning. During execution 


1. Processor outputs are driven by the processor. 


2. Processor internal output signals are sampled into the BSR. This is default behavior 


for most signals, but allows the sampling of STAT2—STATO. | 
3. Processor input signals are sampled into the BSR. This is default behavior. 


4. The processor CNTL field is driven by the PDR. Processor internal inputs are driven 
from the processor inputs. 


ICTEST2 


The ICTEST2 instruction is defined for AMD processors using the extension mecha- 
nisms permitted by IEEE 1149.1-1990. ICTEST2 is similar to EXTEST with the excep- 
tion that the scan path for ICTEST2 excludes most of the processor outputs so the 
system is not disrupted (for example, by interfering with refresh). This allows a hard- 
ware-development system to access and modify processor internal state without 
disrupting the system. | 


1. Processor ID31-IDO and STAT2—STATO outputs are driven from the PDR. The output 
enable for the ID bus is controlled by the PDR. Other processor outputs are controlled 
by the processor. | 


2. Processor internal output signals for |D31-IDO and STAT2—STATO are sampled into 
the BSR. This allows a hardware-development system to sample the processor’s Sta- 
tus and data driven by the processor. 


3. Processor internal input signals for |D31-IDO are driven from the PDR. This allows a 
hardware-development system to provide data to the processor, independent of sys- 
tem controls. | 
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BYPASS 


The BYPASS instruction is provided to bypass the BSR and shorten access times to 
other devices at the board level. 


BYPASS selects the Bypass Register for scanning. The processor is not otherwise 
affected. 


Order of Scan Cells in Boundary-Scan Path 


This section documents the scan paths and the order of scan cells in the paths. The 
cells are listed in order from TDI to TDO. In the Am29200 microcontroller, there are five 
scan paths from TDI to TDO: 1) the instruction path, 2) the Bypass path, 3) the main 
data path, 4) the ICTEST1 path, and 5) the ICTEST2 pall 


instruction Path 


This three-cell path outlined in Table 17-1 is used to scan into the Instruction Register. 
When the instruction path is selected the captured data is always IREG2-IREGO = 001 
and the instruction is set by scanning. The preloaded pattern 001 is used to test for 
faults in the boundary-scan connections at the board level. The instructions are specified 
in Section 17.6.2. 


instruction Scan Path 


Bit Cell Name 
1 IREG2 

2 IREG1 

3 IREGO 
Bypass Path 


This is a one-cell path that is used to bypass the processor and shorten access to other 
devices at the board level. When the bypass path is selected, the captured data is 
always 0 and the scan-in data has no effect on the processor. 


Main Data Path 


This is a 188-cell path used to access the processor pins. This path is divided into five 
sets of cells. Where applicable, each set has a cell that enables the outputs of the set to 


be driven on the processor’s pins. These cells are not connected to a processor pin. For 


convenience, the drive enable cells are shown in Table 17-2 in boldface. Some of these 
cells affect outputs not normally enabled and disabled during normal system operation... 
The sets of cells are divided logically as follows: 1) clocks, requests, and reset, 2) 
miscellaneous peripheral control signals, 3) memory and peripheral controls, 4) instruc- 
tion/data bus. 
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Comments 


The MEMCLK scan cell is an output scan cell: 
it captures processor internal MEMCLK and 
substitutes the scanned value for the output. 


Enables the drivers for PSYNC through PWE 
PSYNC input 

PSYNC output 

VDAT input 

VDAT output 


- PIOO input 


PIOO output 
PIO1 input 


PIO1 output 


P1015 input 
P1015 output 


Enables the driving of the A23--A0 outputs 


Enables the drivers for DACKO through DSR 
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103 TR 

104 CASO 

105 CAS1 

106 CAS2 

107 CAS3 

108 RASO 

109 RAS1 

110 RAS2 

111 RAS3 

112 ROMOE 

113 RSWE 

114 BURST 

115 ROMCSO 

116 ROMCS1 

117 ROMCS2 

118 ROMCS3 

119 TXD 

120 DSR 

121 UCLK 

122 RXD 

123 DTR 

124 DBIDRV Enables the ID bus drivers 
125 IDIO IDO input 
126 [DOO IDO output 
127 IDI4 ID1 input 
128 IDO1 ID1 output 
187 © IDI31 ID31 input - 
188 IDO31 ID31 output 


Note: Drive-enable cells are shown in boldface. 
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ICTEST1 Path 


This five-bit path, outlined in Table 17-3, is used to provide quick access to the CNTL 
field and the STAT2—STATO output signals while keeping other inputs and outputs in 
their normal system connection. 


ICTEST1 Scan Path 


Bit Cell Name Comments . 

1 ~CNTLO Internal control field only 

2 CNTL1 

3 STATO Outputs: These signals are scanned out and are 

4 STAT1 shown on the TDO pin. The scan-in values do not 
5 STAT2 replace the processor output values. In ICTEST1, 


the processor outputs STAT2—STATO continue to 
reflect the internal processor signals. 


lf the ICTEST1 path is scanned, the contents of the shift register bits in the other scan 
cells become undefined. This occurs because all scan paths share the same shift clocks. 


ICTEST2 Path 


The ICTEST2 path includes only the ID bus, the CNTL field, and the STAT2—STATO 
signals. It is provided so a hardware-development system can access the processor 
without disrupting the system. 
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ICTEST2 Scan Path 


Bit Cell Name Comments 

1 CNTLO Internal control field only | 

2 CNTL1 

3 STATO Outputs: These signals are scanned out and are 

4 STAT1 shown on the TDO pin. The scan-in values do not 
5 STAT2 replace the processor output values. In ICTEST1, 


the processor outputs STAT2—STATO continue to 
reflect the internal processor signals. 


6 DBIDRV Enables the ID bus drivers 
7 IDIO IDO input 

8 IDOO IDO output 

9 IDI1 ID1 input 

10 IDO1 ID1 output 

69 IDI31 ID31 input 

70 IDO31 ID31 output 


IMPLEMENTING A HARDWARE-DEVELOPMENT SYSTEM 
(Am29200 MICROCONTROLLER) 


The Halt, Step, and Load Test Instruction modes of operation, invoked using the CNTL 
field in the boundary-scan path, are defined to support the debugging of the processor 
system by a hardware-development system (both hardware and software debug). This 
section describes the use of these modes during debug and describes the correspond- 
ing activity on the CNTL field and STAT2—STATO pins. 


Halt Mode 

The Halt mode allows the hardware-development system to stop processor operation 
while preserving its internal state. The Halt mode is defined so normal operation may 
resume from the point the processor enters the Halt mode. All external accesses are 

completed before the Halt mode is entered, so a minimum amount of system logic is | 

required to support the Halt mode. 


The Halt mode can be invoked by applying a value of 10 to the CNTL field. The proces- 
sor enters the Halt mode within two or three cycles after the CNTL field is changed 
(depending on synchronization time), except it first completes any external data access 
in progress. 


The Halt mode can also be entered as the result of executing a HALT instruction. When 
a HALT instruction is executed, the processor enters the Halt mode on the next cycle, 
except it completes any external data accesses in progress. In this case, the processor 
remains in the Halt mode even though the CNTL field is 11. However, the processor 
cannot exit the Halt mode except as the result of the CNTL field or RESET input. If the 
instruction following a Halt instruction has an exception (e.g., instruction mapping miss), 
the trap associated with the exception is taken before the processor enters the Halt 
mode. , 


The Halt instruction is designed as an instruction breakpoint by the hardware-develop- 
ment system. However, the Halt instruction is normally a privileged instruction, causing a 
Protection Violation trap upon attempted execution by a User-mode program. The 
hardware-development system can disable this Protection Violation as ger ee in 
Section 17.7. 5. 


In most cases, the STAT2—STATO outputs have a value of O00 whenever the processor 
is in the Halt mode. These outputs can be used to verify the processor its in Halt mode. 
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However, the STAT2—STATO outputs have a value of 010 if the Freeze (FZ) bit of the 
Current Processor Status Register is 1 when the Halt mode is entered. This indicates the 
visible registers do not reflect the current program state. 


While in the Halt mode, the processor does not execute instructions and performs no 
external accesses. The timer facility does not operate (i.e., the Timer Counter Register 
does not change). 


The Halt mode is exited when the Reset mode is ange or the CNTL field places the 
processor into another mode. The only valid transitions on the CNTL field from the value 
of 10 are to the value 00, which places the processor into the Load Test Instruction 
mode, or to the value 11, which causes the processor to resume normal execution. 


Step Mode 


The Step mode causes the Am29200 microcontroller to execute at a rate determined by 
the hardware-development system, allowing the hardware-development system to easily 


~ control and monitor processor operation. The Step mode is defined so normal operation 


may resume after stepping is complete. Since all external accesses are completed 
during any step, a minimum amount of system logic is required to support the slower 
rate of execution. 


The Step mode is invoked by the value of 01 in the CNTL field. The processor enters the 
Step mode within two or three cycles after the CNTL field is changed (depending on 
synchronization time), except it first completes any external data access in progress. 


In most cases, the STAT2—STATO outputs have a value of 000 whenever the processor 
is in the Step mode; these outputs can be used as a verification the processor is in Step 
mode. However, the STAT2—STATO outputs have a value of 010 if the Freeze (FZ) bit of 
the Current Processor Status Register is 1 when the Step mode is entered. This 
indicates the visible registers do not reflect the current program state. 


While in the Step mode, the processor does not execute instructions and performs no 
external accesses. The Timer Facility does not operate (i.e., the Timer Counter Register 
does not change) while the processor is in the Step mode. 


The Step mode is identical to the Halt mode in every respect except one. This difference 
is apparent on the transition of the CNTL field from the value 01 (Step mode) to the 
value 11 (Normal). On this transition, the processor steps. That is, the processor state 
advances by one pipeline stage, and it completes any external access that is initiated by 
this state change. 


If the processor immediately enters the Pipeline Hold mode on a step, the aiee may 
require multiple cycles to execute, since the processor pipeline cannot advance while 
the processor is in the Pipeline Hold mode. The STAT2-STATO lines reflect the state of 
the processor for every cycle of the step. 


The timer counter decrements by one for every cycle of the step; if the timer counter 
decrements to zero, the usual timer-facility actions are performed and a timer interrupt 
may occur. 


After the step is performed, the processor re-enters the Step mode and remains in the 


Step mode even though the CNTL field has the value 11 (this prevents the need fora 


time-critical transition on the CNTL field). The processor remains in this condition until the 
CNTL field transitions to 10 or 01 (or RESET is asserted). The transition to 10 causes the 
processor to enter the Halt mode and is used to clear the Step mode. The transition to 01 
causes the processor to remain in the Step mode so it may perform additional steps. 
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If the processor is placed in the Halt or Step mode while either a LOADM or STOREM 
instruction is being executed, the STAT2—-STATO outputs indicate the Halt or Step mode 
for one cycle (STAT2—STATO = 000). They then indicate the Pipeline Hold mode 
(STAT2—-STATO = 001) until the final access of the LOADM or STOREM is complete, at 
which time they return to indicating the Halt or Step mode. A hardware-development 
system must therefore ignore any single-cycle Halt/Step mode indication on the 
STAT2—STATO outputs as an indication the processor is halted. 


Load Test Instruction Mode 


The processor incorporates an Instruction Register (IR) that holds instructions while they 
are decoded. In the Load Test Instruction mode, the IR is enabled to receive the content 
of the instruction bus regardless of the state of the processor’s instruction fetcher. This 
allows the hardware-development system to provide instructions for execution directly, 
thereby providing means for the hardware-development system to examine and modify 
the internal state of the processor without altering the processor’s instruction stream. 


The hardware-development system can place an instruction in the IR by first placing 00 in. 
the CNTL field. The processor enters the Load Test Instruction mode within two or three 
cycles after the CNTL field is changed (depending on synchronization time). However, it 
first completes and terminates any established burst-mode instruction access. The Load 
Test Instruction mode can be entered only from the Halt or Step modes. 


When the processor enters the Load Test Instruction Mode, the processor behaves as 
though the Current Processor Status Register were forced to the value shown in 
Figure 17-4, even though the register is not changed (the value “u” means unaffected). 


Processor Status While in Load Test Instruction Mode 





0 

Reserved ' eee ' IMt 
TD ; IPY - EZ : WM ; SM , DA 

res TE TU res res DI 


The visible processor state remains unchanged while the processor is in the Load Test 
Instruction Mode. The processor status shown in Figure 17-4 remains in effect until the 
next transition to the Normal Mode via the Halt Mode. 


While the processor is in the Load Test Instruction mode, it ignores all interrupts and 
traps, except for the WARN trap. 


The STAT2-STATO lines have a value of 010 while the processor is in the Load Test 
Instruction mode; this may be used as a verification that the processor is loading the IR. 





While the processor is in the Load Test Instruction mode, the IR is continually storing the 
value on the instruction/data bus; any change in the value on this bus is reflected in the IR 
on the next cycle. The hardware-development system can place a desired instruction into 
the IR by driving this instruction on the Instruction/Data Bus or via the scan interface. 


The processor exits the Load Test Instruction mode in the second cycle following a 
change to the CNTL field. The only valid change here is either to the Halt mode 
(CNTL = 10) or the Step mode (CNTL = 01). 
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When the Load Test Instruction mode is exited, the most recent value stored into the IR 
is held. lf the processor is placed in the Step mode, the IR is marked as having valid 
content, enabling the processor to decode and execute the instruction. If the processor 
is placed in the Halt mode, it ignores any instruction placed in the IR by the Load Test 
Instruction mode and reverts to its normal instruction-fetch mechanism. 


Once the IR has been set by the Load Test Instruction mode, the instruction in the IR 
may be executed via the Step mode as discussed in the previous section. A single step 
is sufficient to cause the execution of this instruction. However, because of pipelining, 
multiple steps may be required before the instruction completes execution. If more than 
one step is performed, the processor executes the instruction in the IR on every step. If 
it is desired to step an instruction to completion without repeated execution, a NO-OP 
may be set into the IR (using the Load Test Instruction mode) after the first step. 


The Load Test Instruction mode may be used to cause the execution of most processor 
instructions (restrictions are discussed below). This allows inspection and modification of 
the processor state. 


Because of sequencing constraints, the Load Test Instruction mode cannot be used to 
cause the execution of the following instructions: conditional jumps, Load Multiple, Store 
Multiple, Interrupt Return, and Interrupt Return and Invalidate. Unconditional jumps and 
calls are permitted, but affect only the Program Counter. Instruction sequencing is not 
affected. : | 


The contents. of the Program Counter 0, Program Counter 1, Program Counter 2, 
Channel Address, Channel Data, Channel Control, and ALU Status registers are not 
updated while instructions are executed via the Load Test Instruction mode, except 
explicitly by Move To Special Register instructions. Instructions executed using the Load 
Test Instruction mode may access the protected processor state even though the 
processor is in the User mode. | 


Instructions executed via the Load Test Instruction mode may be used to access an 
external device or memory. Recall that the processor completes any normal data access 
before completing a step. This allows the processor to access devices and memories on 
behalf of the hardware-development system and simplifies the timing constraints on the 
hardware-development system. 


During processor execution via the Load Test Instruction mode, the processor retains 
the information required to resume normal operation. If any processor state is modified 
by the hardware-development system, this state must be restored properly for normal 
operation to resume properly. 


Once all instructions have been executed via the Load Test Instruction mode, the Halt 
mode (CNTL=10) prepares the processor to resume normal operation. When the CNTL 
field transitions to 11, the processor resumes normal operation. The sequence for the 
CNTL field to clear the Load Test Instruction mode and resume normal operation is thus 
00/10/11. 


Accessing Internal State Via Boundary Scan 


The hardware-development system uses load and store instructions, executed via the 
Load Test Instruction mode, to alter and inspect the contents of general-purpose registers. 
The OPT field for these loads and stores have the value 110 and are directed to the ROM 
address space (for example, address 0): this causes the processor to prevent the 
resulting access from appearing in the system. The access is visible only via the bound- 
ary-scan register. Furthermore, it causes the Am29200 microcontroller to ignore the 
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generation of wait states: the access completes at the end of the next stepped instruction. 
This provides a means for a hardware-development system to perform accesses. 


It is not possible to execute a load directly following a store, nor a store directly following 
a load, using the Load Test Instruction mode. At least one NO-OP (or other operation) 
must be executed between adjacent loads and stores, because of control conflicts that 
arise when these instructions are stepped in a system that performs the resulting 
accesses at normal speed. However, a sequence of only loads or only stores is per- 
mitted without restriction. 


This section describes the. sequence of boundary-scan operations panos to access 
processor internal state. 


Altering State Via Boundary Scan — 


A hardware-development system uses load instructions to alter the contents of general- 
purpose registers. Since the contents of general-purpose registers can be moved to 
special-purpose registers, this provides a means to alter other state as well as the 
values in general-purpose registers. 


With the processor in the Halt mode, the hardware-development system uses the 


- following sequence to modify the value in a general-purpose register: 


1. Set the CNTL field to 10 (Halt) using the ICTEST1 boundary-scan instruction. 
2. Set the CNTL field to 00 (Load Test Instruction) using the ICTEST1 instruction. 


3. Using the ICTEST2 instruction, set the IDI31-IDIO cells with an instruction to load the 
desired register from the ROM address space, with OPT=110, and set the CNTL field 
to 01 (Step). This places the load instruction into the IR and prepares the processor 
to step. 


4, Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, 
and 00 (Normal, Step, and Load Test Instruction). This steps the processor and pre- 
pares it to receive another instruction. 


5. Using the ICTEST2 instruction, set the IDI31-IDIO cells to 70400101, hexadecimal 


(NOOP), and set the CNTL field to 01. This loads a NO-OP into the IR. 


6. Using the ICTEST2 instruction, set the IDI31—IDIO cells to the value to be loaded and 
set the CNTL field to 11. This steps the processor and applies the value to be loaded 
into the register. 


7. Set the CNTL field to 01 using the ICTEST1 instruction. 
8. Repeat steps 2 through 7 for the remaining registers. 


Inspecting State Via Boundary Scan 


A hardware-development system uses store instructions to inspect the contents of 
general-purpose registers. Since the processor internal state can be moved to general- 
purpose registers, this provides a means to inspect other states as well as the values in 
general-purpose registers. 


With the processor in the Halt mode, the hardware-development system uses the 
following sequence to retrieve the value in a general-purpose register: 


1. Set the CNTL field to 10 (Halt) using the ICTEST1 boundary-scan instruction. 
2. Set the CNTL field to 00 (Load Test Instruction) using the ICTEST1 instruction. 
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. Using the ICTEST2 instruction, set the IDI(31-0) cells with an instruction to store the 


desired register into the ROM address space, with OPT=110, and set the CNTL field 
to 01 (Step). This places the store instruction into the IR and prepares the processor 
to step. 


. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, 


and 00.(Normal, Step, and Load Test Instruction). This steps the processor and pre- 
pares it to receive another instruction. 


. Using the ICTEST2 instruction, set the IDI31-IDIO cells to 70400101, hexadecimal 
_(NO-OP), and set the CNTL field to 01. This loads a NO-OP into the IR. 


. Set the CNTL field to 11, then back to 01 using the ICTEST1 instruction. This steps 


the processor. At the end of the step, the contents of the register are on the ID bus, 
and may be obtained in the Capture-DR state of the TAP controller (this state is de- 
scribed in the IEEE 1149.1-1990 Specification). The value will be held on the ID bus 
until the next step. 


Repeat steps 2 through 6 for the remaining registers. 


17.7.5 HALT Instructions as Breakpoints 


The HALT instruction can be used by a hardware-development system to implement an 
instruction breakpoint. To use the HALT instruction as an instruction breakpoint, the 
hardware-development system must disable the protection checking that normally 
applies to the HALT instruction so the HALT does not cause a Protection Violation trap. 
To accomplish this, the hardware-development system must perform a special sequence 
of operations on the boundary-scan interface. This sequence is similar in effect to 
holding the CNTL1—CNTLO inputs at 10 during a reset on other 29K Family processors. 
The special sequence is needed in the Am29200 microcontroller because it has no 
CNTL1—CNTLO inputs, but rather implements a CNTL field in the boundary-scan 
register. The following sequence disables protection checking on the HALT instruction: 


1. 
2. 
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Set the CNTL field to 10 using the ICTEST1 JTAG instruction. 


Reset the boundary-scan cells RESET, DBIDRV, BOTDRV, ABIDRV, and TOPDRV to 
0 using the INTEST instruction. If the boot ROM in Bank 0 is 8 or 16 bits wide, reset 
the BOOTW cell to 0. If the boot ROM is 32 bits wide, set the BOOTW cell to 1. This 
resets the processor. 





. Set the RESET cell to 1 using the INTEST instruction. If the boot ROM is 8 bits wide, 


set the BOOTW cell to 1 (otherwise leave it at 0 or 1). This takes the processor out of 
reset and configures the boot ROM. 


. Set the CNTL field to 00 using the ICTEST1 instruction. 
. Using the ICTEST2 instruction, set the IDI31-IDI0 cells to a1000000, hexadecimal, 


and set the CNTL field to 01. This loads a jump to address 0 into the IR and prepares 
the processor to step. 


. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, 


and 00. This steps the processor and prepares it to receive another instruction. 


. Using the ICTEST2 instruction, set the IDI31—IDIO cells to 70400101, hexadecimal. 


This loads a NO-OP into the IR. 


. Using the ICTEST1 instruction, sequence the CNTL field through the values 11, 01, 


and 11. This starts the processor with protection checking disabled-on the HALT 
instruction. The TMS input must be kept High during this operation. 
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Forcing Outputs to High Impedance 


For the Am29200 microcontroller, a hardware-development system can force processor 
outputs to the high-impedance state by asserting the GREQ input during a processor 
reset. The outputs remain in the high-impedance state until a processor reset during 
which GREQ is not asserted. This supports functions such as replacing chip signals by 
emulator signals. 





For the Am29205 microcontroller, a hardware-development system.can force processor 
outputs to high impedance by asserting the WAIT/TRIST signal. The outputs remain in 
the high-impedance state gn a processor reset during which WAIT/TRIST is not 
asserted. 











EMULATING THE Am29205 MICROCONTROLLER 


Unlike the Am29200 microcontroller, the Am29205 microcontroller does not have a 
hardware-development interface for testing and debugging. Hardware development is 
performed by using an Am29200 microcontroller to emulate an Am29205 microcontroller. 


Figure 17-5 shows a typical configuration for using an Am29200 microcontroller to 
emulate the Am29205 microcontroller. The hardware-development system asserts the 
WAIT/TRIST pin during processor reset to force the outputs of the Am29205 microcon- 
troller into high impedance state. An Am29200 microcontroller is then connected to the 
Am29205 microcontroller and interfaced directly to the system logic. The hardware-de- 
velopment system then uses the JTAG to access the CNTL field on the Am29200 
microcontroller and perform the hardware debugging functions previously described in 
this chapter. 





To avoid contention on the WAIT TRIST signal, the target system must drive the 
WAIT/TRIST pin with an open collector buffer (e.g., 74HCO7) and a pull-up resistor. The 
hardware-development system must continue to drive the WAIT/TRIST pin Low for at 
least one MEMCLK cycle following the deassertion of RESET So the correct value on the 
WAIT/TRIST pin is latched into the internal logic. 


On the Am29200 microcontroller, the memories (ROM and DRAM) are attached to 
ID31-ID16 and the peripherals (PIA and Parallel Port Data Register) are attached to 














‘1D15-IDO. On the Am29205 microcontroller, the memories and peripherals are attached 


to 1ID31-1D16. For the Am29200 microcontroller to access the memories and peripherals 
correctly, ID31-ID16 on the Am29200 microcontroller must be connected to ID15-IDO. 
To avoid contentions on the ID bus, three-state buffers (e.g., 2 x 74HC244) are used to 
isolate ID31-ID16 from ID15—IDO. These three-state buffers will be enabled when the 
processor is doing a PIA or parallel port read access. 
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Figure 17-5 Using an Am29200 Microcontroller to Emulate an Am29205 Microcontroller 
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CAS1-CAS0, TR/OE, 
DACKO, GACK, 
PIACS5-PIACS2, BURST, 
DSR, PIO7-PIOO 
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This chapter provides a specification of the instruction set for the Am29200 and 
Am29205 microcontrollers. Sections 18.1 and 18.2 describe the terminology and the 
instruction formats. Section 18.3 describes each instruction in detail; instructions are 
presented alphabetically by assembler mnemonic. Finally, Section 18.4 gives an index of 
instructions by operation code. 


INSTRUCTION-DESCRIPTION NOMENCLATURE 


To simplify the specification of the instruction set, special terminology is used throughout 
this chapter. This section defines the terminology and symbols used to describe 
instruction operands, operations, and the assembly-language syntax. | 


This section does not describe all terminology used. It excludes certain descriptive terms 
with obvious meanings. 


Operand Notation and Symbols 


Throughout this chapter, instruction operands are signed two’s-complement word 
integers unless otherwise noted. The term “register” is used consistently to denote a 
general-purpose register. Other types of registers are described explicitly. 


The following notation is used in the description of instruction operands: 


0116 | 16-bit immediate data, zero-extended to 32 bits 

1116 _. 16-bit immediate data, one-extended to 32 bits 

116 16-bit immediate data 

BP The Byte Pointer (BP) field of the ALU Status Register. The BP field 
_ selects a byte or half-word within a word. | 

C The Carry (C) bit of the ALU Status Register. The C bit is logically 

zero-extended to 32 bits when involved in a word operation. 
COUNT The value of the Count Remaining field of the Channel Control Reg- 


ister. Note that COUNT does not refer to this field directly, but rather 
to the value of the field at the beginning of aLOADM or STOREM 


instruction. 

DEST The general-purpose register that is the destination of an instruction 
(i.e., the register used to store the result). 

EXTERNAL The word in an external device or memory with address n. 

WORD[n] | | 3 

FALSE The Boolean constant FALSE 

FC The Funnel Shift Count (FC) field of the ALU Status Register 

h'n’ The hexadecimal constant n | 

IPA Indirect Pointer A Register 

IPB | Indirect Pointer B Register 
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IPC 
PC 


Q 


Register RA 
Register RB 
Register RC 


_SPDEST 


SPECIAL 


Special-purpose 
Register SA 


SRCA 
SRCB 


SRCA.BYTEn 
SRCB.BYTEn 


TARGET 


TRUE 
TWIN 


A<<B 
A>>B 


Ali B 


A&B 
A|IB 
AB 
~A 
A< exp 
A=B 
A<>B 


Indirect Pointer C Register 


Program Counter Register. This register is not explicitly 

accessible by instruction, but does appear as an operand for 
certain instructions. The Program Counter always contains the word 
address of the instruction being executed, and is 30 bits in length. 


Q Register 
These designate the general-purpose registers specified by the 


instruction fields RA, RB, and RC (see Section 18.2). 


The special-purpose register that is the destination of an instruction. 


The contents of a special-purpose register, used as an 
instruction operand. 


Designates the special-purpose register specified by the 
instruction field SA (see Section 18.2). 


The contents of general-purpose registers, used as instruction 
operands. 


Designate the byte numbered n within the SRCA or SRCB 
operand. 


The target-instruction address specified by a jump or call instruc- 
tion. This address is either absolute or Program-Counter relative. 


Boolean constant TRUE 


General-purpose registers are paired by absolute-register 
numbers, such that even-numbered registers are paired with odd- 
numbered registers having the next-highest register number. The 
twin of a given register is the other register in the pair to which the 
given register belongs. For example, Local Register 5 is the twin of 
Local Register 4, and vice versa. | 


: Operator Symbols 
The following symbols are used to describe instruction operations: 


Left shift of the A operand by the shift amount given 
by the B operand | 


Right shift of the A operand by the shift amount given — 
by the B operand 


Concatenation. The B operand is appended to the A operand. In the 
resulting quantity, the A operand makes up the high-order part, and 
the B operand makes up the low-order part. 


Bitwise AND 

Bitwise OR 

Bitwise exclusive-OR 
One’s-complement 


Assignment of the A location by the result of the expression on the 
right side 


Equal to 
Not equal to 
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A>B 
A>B 
A<B 
A<B 
A+B 
A-B 
A*B 
A/B 

A..B 


AORB 
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Greater than 

Greater than or equal to 

Less than 

Less than or equal to 

Addition 

Subtraction 

Multiplication 

Division | 

A subrange that includes the A operand and the B operand. This 
symbol is used for subranges of bits as well as subranges of words. 
Logical OR of two Boolean conditions 


Control-Flow Terminology . 


The following terminology is used to describe the control functions performed during the 
execution of various instructions: | 


Continue 


IF condition 
THEN operations 
ELSE operations 


Signed overflow 
Trap(n) 


Unsigned 
overflow 


Unsigned 
underflow 


VN 


Continue execution of the current instruction sequence. 


The condition following the IF is tested. If the condition holds, the 
operations following the THEN are performed. If the condition 
does not hold, the operations following the ELSE are performed. If 
the ELSE is not present and the condition does not hold, no opera- 
tion is performed. 


This condition is present when the result of an add or subtract of 
two’s-complement operands cannot be represented by a signed 
word integer. 


Specifies a trap with vector number n. The vector number n may be 
specified indirectly (e.g., Trap (VN)) or explicitly by symbolic name 
(e.g., Trap (Out-of-Range)). | 

This condition is present when the result of an add of unsigned 
operands cannot be represented by an unsigned word integer. 


This condition is present when the result of a subtract of 
unsigned operands cannot be represented by an unsigned 
integer (i.e., when the result is less than zero). 


Designates the trap vector number specified by the instruction field 
VN (see Section 16.3.2). 


Assembler Syntax 


This chapter does not contain a full description of the instruction assembler, but 
provides a rudimentary description of the assembler syntax. 


cntl 
const8 
const16 


_ The following notation is used to describe assembler tokens: 


Determines the 7-bit control field in a load or store instruction. 
Specifies a constant that can be expressed by 8 bits. 
Specifies a constant that can be expressed by 16 bits. 
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Figure 18-1 
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fa These tokens name general-purpose registers. In a formal 
rb sense these represent the same token since the name of a 
rc _. register does not depend on its instruction use. However, three dis- 


tinct tokens are used to clarify the relationship between the assem- 
bler syntax, instruction operands, and instruction fields. 


spid | A symbolic identifier for a special-purpose register. 


target A symbolic label for the target of a jump or call instruction. 
vn Specifies a trap vector number. 


INSTRUCTION FORMATS 


All instructions for the Am29200 and Am29205 microcontrollers are 32 bits in length and 
are divided into four fields, as shown in Figure 18-1. These fields have several alterna- 


tive definitions, as discussed below. In certain instructions, one or more fields are not 


used, and are reserved for future use. Even though they have no effect on processor 
operation, bits in reserved fields should be 0 to insure compatibility with future processor 
versions. 


Instruction Format 


M 117... 110 SA RB or! 
115 ...18 19... 12 
VN 17... 10 
CNTL UI //RND //FD// FS 
Reserved // FS 


The instruction fields are defined as follows: 


Bits 31-24 

OP This field contains an operation code that defines the operation to 
be performed. In some instructions, the least significant bit of the 
operation code selects between two possible operands. For this 

_ reason, the least significant bit is sometimes labeled A or M with the 
following interpretations: 

A (Absolute): The A bit is used to differentiate between Program- 
Counter relative (A = 0) and absolute (A = 1) instruction addresses 
when these addresses appear within instructions. 

M (Immediate): The M bit selects between a register operand (M = 0) 
and an immediate operand (M = 1) when the alternative is allowed 
by an instruction. 

Bits 23-16 

RC The RC field contains a global or local register number. 
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117... 110 


115... 18 


VN 
CNTL 


Bits 15-8 


RA 


SA 
Bits 7—0 


RB 


RB or | 


17... lO 


AMD de | 


This field contains the most significant eight bits of a 16-bit 
instruction address. This is a word address and may be 
program-counter relative or absolute depending on the A bit of the 
operation code. 


This field contains the most significant eight bits of a 16-bit 
instruction constant. 


This field contains an 8-bit trap vector number. 


This field controls a load or store access as described in 
Section 3.3.1 | 


The RA field contains a global or local register number. 
The SA field contains a special-purpose register number. 


The RB field contains a global or local register number. 


This field contains either a global or local register number, or an 
8-bit instruction constant depending on the value of the M bit of the 
operation code. 


This field contains the least significant eight bits of a 16-bit instruc- 
tion address. This is a word address and may be program-counter 
relative or absolute depending on the A bit of the operation code. 


This field contains the least significant eight bits of a 16-bit | 
instruction constant. | 


Ul // RND //FD//FS This field controls the operation of the CONVERT instruction. 


~ reserved // FS 


This field is the FS portion of the above field and specifies the 
operand format for the CLASS and SQRT instructions. 


The fields described above may appear in many combinations. However, certain 
combinations that appear frequently are shown in Figure 18-2. 


INSTRUCTION DESCRIPTION 


This section describes each microcontroller instruction in detail, Figure 18-3 illustrates 
the layout of the information given for each description. : 


Instruction Set | 18-5 


zt AMD 
Figure 18-2 Frequently Occurring Instruction Field Uses 


Three operands with possible 8-bit constant: 


31 23 15 7 0 
a 


Three operands without constant: 


XXX XXX XM 


31 23 15 7 | 0 
= om fe 


One register operand with 16-bit constant: 


KOKI 0 


: 


31 23 15 7 0 


Jumps and calls with 16-bit instruction address: 


31 23 15 7 


XxX XXXXXA 117... 10 


Two operands with trap vector number: 
31 23 


Loads and stores: 


oO 


I9..12 


RB or! 


on a 
D 
> > 


N N 
oO © 


31 23 
XX XX XX XM CNTL ~ RA RB or] 
Res 
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Figure 18-3 Instruction-Description Format 


Instruction 

Mnemonic 

Instruction 

Name 

ot Operation Operation: 
Description 

Assembler ee 
Syntax ——————__——_> yntan: 
Arithmetic/Logic 

Status Result Status: 
Operand Specification— Operands: 


Describes the 
instruction fields’ 
relations to operands, 
and implicit operands 
in some cases 


instruction Format— 31 


AMD ct 


Add 


DEST - SRCA + SRCB 


ADD re, ra, rb 
or 
ADD re, ra, const8 


V, N, Z, C 


SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: 1 (Zero-extended to 32 bits) 


DEST Register RC 


23 15 7 0 


Specifies field , 


Operation Code— OP = 14,15 


HEX format - 


Detailed Description 
of instruction 
operation 





ADD 


Description: The SRCA operand is added to the SRCB 


operand and the result is placed into the 
DEST location. 
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ADD 


Operation: 


Assembler 
Syntax: 


Status: 


Operands: 


31 


~ADD 
Add 
DEST «+ SRCA + SRCB 
ADD re, ra, rb © 
or 
ADD re, ra, const8 
V,N, Z, C 
SRCA Content of register RA 
SRCB ‘M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
23 | 15 7 0 


OP = 14, 15 ; ADD 


Description: 
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The SRCA operand i is added to the SRCB operand and the result | is 
placed into the DEST location. 
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~ ADDC | ADDC 
Add with Carry | 


Operation: DEST — SRCA+SRCB+C 


Assembler 
Syntax: ADDC rc, ra, rb 
or 
ADDC re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
_SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST — Register RC 
31 | 23 7 15 . 7 0 
OP = 1C, 1D ADDC 


Description: The SRCA operand is added to the SRCB operand and the value of the 
ALU Status Carry bit, and the result is placed into the DEST location. 
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ADDCS a ADDCS 
Add with Carry, Signed 


Operation: DEST << SRCA+SRCB+C 
IF signed overflow THEN Trap (Out of Range) 


Assembler 
Syntax: ADDCS rc, ra, rb 
or | 
ADDCS re, ra, const8 


Status: V,N,2Z,C 


Operands: SRCA Content of register RA 
| SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
«31 23 15 7 0 
OP = 18, 19 | ADDCS 


Description: The SRCA operand is added to the SRCB operand and the value of the 
ALU Status Carry bit, and the result is placed into the DEST location. If 
the add operation causes a two’s-complement signed overflow, an 
Out-of-Range trap occurs. 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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~ADDCU > 
Operation: 
Assembler 
Syntax: 
Status: 


Operands: 


31 


AMD \ 


ADDCU 

Add with Carry, Unsigned 
DEST.<— SRCA + SRCB + C 
IF unsigned overflow THEN Trap (Out of Range) 
ADDCU re, ra, rb 

or | 

ADDCU re, ra, const8 
V, N, Z, C 
SRCA Content of register RA 
SRCB > M = 0: Content of register RB 

~M=1: I (Zero-extended to 32 bits) 
DEST Register RC 

23 | 15 7 0 


OP=1A,1B _ ADDCU 


Description: 


The SRCA operand is added to the SRCB operand and the value of the 
ALU Status Carry bit, and the result is placed into the DEST location. If 
the add operation causes an unsigned overflow, an Out-of-Range trap 


- OCCUIS. 


Note that the DEST location is altered whether or not an overflow 
occurs. : | 
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| | ADDS ~ | ADDS. 
Add, Signed 
Operation: DEST <— SRCA+SRCB 
| IF signed overflow THEN Trap (Out of Range) 


Assembler 
Syntax: ADDS re, ra, rb 
or 
ADDS re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31. 23 15 | 7 , 0 
OP = 10, 11 ADDS | 


Description: The SRCA operand is added to the SRCB operand and the result is 
placed into the DEST location. If the add operation causes a 
two’s-complement signed overflow, an Out-of-Range trap occurs. 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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ADDU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD Pa | 
ADDU 


Add, Unsigned 
DEST — SRCA + SRCB 
IF unsigned overflow THEN Trap (Out of Range) 
ADDU rc, ra, rb 
or 

ADDU rc, ra, const8 
V,N,Z,C 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 

M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 

23 15 7 | 0 


OP = 12, 13 | ADDU 


Description: 


The SRCA operand is added to the SRCB operand and the result is 
placed into the DEST location. If the add operation causes an unsigned 
overflow, an Out-of-Range trap occurs. — 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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AND AND 
AND Logical 
Operation: DEST < SRCA & SRCB 
Assembler 
Syntax: AND re, ra, rb 
or 
AND re, ra, const8 
Status: N,Z 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits). 
DEST Register RC | 
31 | 23 15 7 0 
OP = 90, 91 AND 


Description: The SRCA operand is logically ANDed, bit-by-bit, with the SRCB 
operand and the result is placed into the DEST location. 
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ANDN 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD cl 


| ANDN 
AND-NOT Logical 
DEST <— SRCA & ~SRCB » 
ANDN re, ra, rb 
or 
ANDN re, ra, const8 
N,Z 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
23 15 7 —— 0 


OP = 9C, 9D 


Description: 


ANDN 


The SRCA operand is logically ANDed, bit-by-bit, with the 
one’s-complement of the SRCB operand and the result is placed into 


_ the DEST location. 
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ASEQ ASEQ 
Assert Equal To 
Operation: IF SRCA=SRCB THEN Continue 
ELSE Trap (VN) 
Assembler | 
Syntax: ASEQ vn, ra, rb 
or 
ASEQ vn, ra, const8 
Status: Not affected 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
VN | Trap vector number 
31 23 15 7 0 
0111000M VN RA RB or | 
OP = 70, 71 ASEQ 
Description: If the SRCA operand is equal to the SRCB operand, instruction 


18-16 


execution continues; otherwise, a trap with the specified vector number 
occurs. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 
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ASGE : | ASGE 
Assert Greater Than or Equal To | 


Operation: IF SRCA >SRCB THEN Continue 


, ELSE Trap (VN) 
Assembler 
Syntax: ASGE vn, ra, rb 
or 


ASGE vn, ra, const8 
Status: Not affected 


Operands: SRCA Content of register RA 
SRCB = #£.M=0O: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
VN Trap vector number 
31 23 15 7 ) 


OP =5C, 5D ASGE 


Description: _ if the value of the SRCA operand is greater than or equal to the value of 
the SRCB operand, instruction execution continues; otherwise, a trap 
with the specified vector number occurs. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 
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ASGEU | | ASGEU 
Assert Greater Than or Equal To, Unsigned 


Operation: IF SRCA > SRCB (unsigned) THEN Continue 


ELSE Trap (VN) 
Assembler 
Syntax: ASGEU vn, ra, rb 
or 


ASGEU vn, ra, const8 
Status: Not affected 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
VN Trap vector number 
31 | 23 15 7 0 
OP = 5E, 5F ASGEU | | 


Description: _ If the value of the SRCA operand is greater than or equal to the value of 
| the SRCB operand, instruction execution continues; otherwise, a trap 
with the specified vector number occurs. For the comparison, both 
operands are treated as unsigned integers. 


For programs in the User mode, a Protection Violation trap : 
- occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 
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ASGT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 





31 23 15 7 0 


AMD ct 


ASGT 
Assert Greater Than 


IF SRCA > SRCB THEN Continue 
ELSE Trap (VN) 


ASGT vn, ra, rb 


ASGT vn, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M =1: | (Zero-extended to 32 bits) 


VN Trap vector number 





OP = 58, 59 ASGT 


Description: 


If the value of the SRCA operand is greater than the value of the SRCB 
operand, instruction execution continues; otherwise, a uae with the 
specified vector number occurs. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 
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ASGTU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


ASGTU 
Assert Greater Than, Unsigned 


IF SRCA > SRCB (unsigned) THEN Continue 
ELSE Trap (VN) 


ASGTU vn, ra, rb 
or 
ASGTU vn, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


VN Trap vector number 


23 15 a 0 


Description: 
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If the value of the SRCA operand is greater than the value of the SRCB 
operand, instruction execution continues; otherwise, a trap with the 
specified vector number occurs. For the comparison, both operands are 
treated as unsigned integers. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between O and 
63 is specified. 
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Operation: 


Assembler 
Syntax: 


Status: 


Operands: | 


31 


AMD at 
- ASLE 
Assert Less Than or Equal To 


IF SRCA < SRCB THEN Continue 
ELSE Trap (VN) 


ASLE vn, ra, rb 
or 
ASLE vn, ra, const8 


Not affected 
SRCA - Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: I (Zero-extended to 32 bits) 


VN Trap vector number 


23 15 7 : 0 


OP = 54, 55 ; ASLE 


Description: 


If the value of the SRCA operand is less than or equal to the value of 
the SRCB operand, instruction execution continues; otherwise, a trap 
with the specified vector number occurs. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 


Instruction Set | 18-21 


al AMD 
ASLEU 


Operation: 


Assembler 
Syntax: 


Status: | 


Operands: 


31 


ASLEU 
Assert Less Than or Equal To, Unsigned 


IF SRCA < SRCB (unsigned) THEN Continue 
ELSE Trap (VN) 


ASLEU vn, ra, rb 
or 
ASLEU vn, ra, const8 


Not affected . 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


VN Trap vector number 


23 15, 7 0 


orororm ow fm | mat 


OP = 56, 57 ASLEU 


Description: 


18-22 


If the value of the SRCA operand is less than or equal to the value of 
the SRCB operand, instruction execution continues; otherwise, a trap 
with the specified vector number occurs. For the comparison, both 


operands are treated as unsigned integers. 


For programs in the User mode, a Protection Violation trap 


occurs—instead of the assert trap—if a vector number between O and 


63 is specified. 


Instruction Set 


ASLT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD ci 


ASLT 
Assert Less Than 
IF SRCA < SRCB THEN Continue 
ELSE Trap(VN) 
ASLT vn, ra, rb 
or 
ASLT vn, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
VN | Trap vector number 
23 | 15 7 | 0 


OP = 50, 51 ASLT 


Description: 


If the value of the SRCA operand is less than the value of the SRCB | 
operand, instruction execution continues; otherwise, a trap with the 
specified vector number occurs. 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. | 


Instruction Set 18-23 


zl AMD 
ASLTU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 





31 93 15 7 0 


ASLTU 
Assert Less Than, Unsigned 


IF SRCA < SRCB (unsigned) THEN Continue 
ELSE Trap (VN) 


ASLTU vn, ra, rb 


ASLTU vn, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
~ Me= 1:1 (Zero-extended to 32 bits) 
VN Trap vector number 





OP = 52, 53 ASLTU 


Description: 


18-24 


If the value of the SRCA operand is less than the value of the SRCB 
operand, instruction execution continues; otherwise, a trap with the 
specified vector number occurs. For the comparison, both operands are 
treated as unsigned integers. | 


For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. 


instruction Set 


AMD | 


ASNEQ ASNEQ 
Assert Not Equal To 
Operation: IF SRCA <> SRCB THEN Continue 
ELSE Trap (VN) 
Assembler 
Syntax: ASNEQ vn, ra, rb 
or 
ASNEQ vn, ra, const8 
Status: Not affected 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
VN Trap vector number 
31 23 15 7 0 
011%1001M VN : RA Bor 
OP = 72, 73 | ASNEQ 
Description: _ If the SRCA operand is not equal to the SRCB operand, instruction 


execution continues; otherwise, a trap with the specified vector number 
occurs. 

For programs in the User mode, a Protection Violation trap 
occurs—instead of the assert trap—if a vector number between 0 and 
63 is specified. | 
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£4 amp 
| CALL 


Operation: 


Assembler | 


Syntax: 
Status: 
Operands: 


31 


CALL 
Call Subroutine 


DEST — PC //00+8 
PC — TARGET 
Execute delay instruction 


CALL ra, target 
Not affected 


TARGET A=0:117... 110/19 ... 12 (sign-extended to 30 bits) + PC 
A=1:117... 110/19 ... 12 (zero-extended to 30 bits) 


DEST Register RA 


23 15 7 * 0 


OP = A8, AQ ; CALL 


Description: 
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The address of the second following instruction is placed into the DEST 
location and a non-sequential instruction fetch occurs to the instruction 
address given by the TARGET operand. The instruction following the 
CALL is executed before the non-sequential fetch occurs. 


Instruction Set 


AMD Ll 


CALLI CALLI 
Call Subroutine, Indirect 


Operation: DEST <—PC//00+8 
PC — SRCB 
Execute delay instruction 


Assembler 
Syntax: CALLI ra, rb 


Status: Not affected 


Operands: SRCB Content of register RB 
DEST Register RA 
—«3t | 23 15 7 0 
ireeteoe ems | me | me 
OP=C8 CALLI | 


Description: The address of the second following instruction is placed into the DEST 
location and a non-sequential instruction fetch occurs to the instruction 
address given by the SRCB operand. The instruction following the 
CALLI is executed before the non-sequential fetch occurs. 
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zt AMD 


CLASS | | ? CLASS 
Classify Floating-Point Operand 


Operation: DEST <— CLASS(SRCA) 


Assembler 
Syntax: CLASS re, ra, FS 


Status: None 


Operands: SRCA Content of register RA (single-precision floating-point) 
or 
Content of register RA and the twin of register RA 
(double-precision floating-point) 


DEST Register RC 


Control: FS Format of source operand SRCA 
. 00 Reserved for future use 
01 Single-precision floating-point 
10 Double-precision floating-point 


11 Reserved for future use 
31 23 15 7 0 


OP = E6 CLASS 


Description: A 32-bit classification code for operand SRCA is placed into the DEST 
location. Operand SRCA is a single- or double-precision operand, as 
specified by FS. The classification code has the following format: 


31: 7 
O 


Bits 31-6: Reserved (forced to 0). 


Bit 5: Operand Sign (OS). The OS bit is 1 for a negative operand 
(including negative zero) and 0 for a non-negative operand. 


18-28 Instruction Set 


CLASS 


EFC © 


00000 
00001 
00010 
00011 


00100 
00101 
00110 
00111 


01000 
01001 
01010 
01011 


01100 
01101 
01110 


011117 | 


10000 
10001 
10010 
10011 


AMD cl 
CLASS 


Bits 4-0: Exponent-Fraction Class (EFC). This field classifies the 
biased exponent and fraction fields of the source operand as follows: 


Biased Exp (bexp) 
0 


0 
0 


1 
1 


1 <bexp < Max 


1 < bexp < Max 
1 < bexp < Max 


Max 


| Max 
Max 


Max + 1 


Max + 1, frac MSB = 0 
Max + 1, frac MSB = 1 


Fraction (frac) 
0 


O < frac < .111... 1 
.141...1 


111... 1 
0 


O<frac <.111... 1 
A eee 


0 


O < frac < .111... 1 


Bee 
0 


<>0 
<> 0 


Comments 


zero 
unused 

denormalized 
denormalized 


0 
unused | 
0 < frac <.111... 1 


unused 


unused 


infinity 
unused 
SNaN 
QNaN 


Note: Max is the largest biased exponent used to represent a finite number in a 
given format. Max is 254 for.single-precision and 2,046 for double-precision. 


This instruction is not supported directly in processor hardware. In the current 
implementation, this instruction causes a CLASS trap. When the trap occurs, 
the IPA and IPC registers are set to reference SRCA and DEST, and the IPB 


Register is set with the value of the FS field. 


Instruction Set 


18-29 


zt AMD ‘ 
CLZ 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| CLZ 
Count Leading Zeros 
DEST < count of number of leading zeros in SRCB or | 
CLZ re, rb 
or 
CLZ rc, const8 
Not affected 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 

DEST Register RC 

23 15 7 0 


Description: 


18-30 


OP = 08,09 CLZ 


A count of the number of zero-bits to the first one-bit in the SRCB 
operand is placed into the DEST location. If the most significant bit of 
the SRCB operand is 1, the resulting count is zero. If the SRCB operand 
is zero, the resulting count is 32. 


Instruction Set 


amp &l 


CONST | a CONST 
Constant 


Operation: DEST <— 0116 


Assembler 
Syntax: CONST ra, consti6 


Status: Not affected | 


Operands: 0116 H15...8//17... 10 (Zero-extended to 32 bits) 
DEST Register RA 
31 , 23 15 7 0 
OP = 03 CONST 


Description: The 0116 operand is placed into the DEST location. 


Note: To improve code readability, some assemblers mpernantt CONST 
to take a 32-bit argument (rather than const16). The lower half of the 
argument is constructed by the CONST. 
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Lt AMD 
CONSTH 


CONSTH 


Constant, High | 
Operation: Replace high-order half-word of SRCA by 116 
Assembler 
Syntax: CONSTH ra, const16 — 
Status: Not affected 
Operands: SRCA Content of register RA 
116 115... 18/17... 10 
DEST Register RA 
31 23 | 15 9) 
00000010 115... 18 RA 10 
OP = 02 CONSTH 
Description: The low-order half-word of the SRCA operand is appended to the 116 


18-32 


register. 


operand and the result is placed into the DEST operand. Note that the 
destination register for this instruction is the same as the source 


Note: To improve code readability, some assemblers implement 
CONSTH to take a 32-bit argument (rather than const16). The upper 


half of the argument is constructed by the CONSTH. 


Instruction Set 


AMD i\ 


CONSTN , - CONSTN 
Constant, Negative 


Operation: DEST <— 1116 


Assembler : 
Syntax: CONSTN ra, const16 


Status: Not affected 


Operands: 1116 115 ... 18 // 17... 10 (ones-extended to 32 bits) 
DEST Register RA 
31 23 15 7 7 0 
00000001 115 ...18 RA 17.10 
OP =01 CONSTN. 


Description: The 1116 operand is placed into the DEST location. 


Instruction Set. 18-33 


cl AMD 


18-34 


CONVERT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


Control: 


31 


CONVERT 
Convert Data Format 


DEST — SRCA, with format modified per Ul, RND, FD, FS 


CONVERT re, ra, Ul, RND, FD, FS 
{pX, fpU, fpV, fpR, foN 


SRCA ~ Content of register RA (single-precision floating-point) 
or 
Content of register RA and the twin of register RA 
(double-precision floating-point) 


DEST Content of register RC (single-precision floating-point) 
or | 
Content of register RC and the twin of register RA 
(double-precision floating-point) 


Ui O = signed integer 

1 = unsigned integer 
RND Round mode 
000 Round to nearest 
001 Round to minus infinity 
010 Round to plus infinity 
011 Round to zero 
100 Round using floating-point round mode (FRM) 
101-111 Reserved : 
FS,FD Format of source-operand, format of destination 

| operand 
00 Integer 
01 Single-precision floating-point 
10 Double-precision floating-point 
11 Reserved 
23 15 7 0 


OP = E4 CONVERT 


Description: 


The SRCA operand with format FS is converted to format FD and 
rounded according to RND, then placed into the DEST location. If the 
source or destination operand is an integer, it is a signed or poneigned 
value according to the value of UI. 


Note: Converting from format to like format is not supported and will 
produce unpredictable results. 


Instruction Set 


CONVERT 


AMD &\ 
CONVERT 


This instruction is not supported directly in processor hardware. In the 
current implementation this instruction causes a CONVERT trap. When 
the trap occurs, the IPA and IPC registers are set to reference SRCA 
and DEST, and the IPB Register is set with the value of the 
UI//RND//FD//FS field. If the UI! bit is 1, the contents of the IPB Register 
reflect the value of this field after Stack-Pointer addition. The Stack 
Pointer must be subtracted from the contents of the IPB Register to 
recover the original value of this field. 


Instruction Set 18-35 


£1 amp 
CPBYTE 


Operation: 


Assembler’ 
_ Syntax: 


Status: 
Operands: 


31 


0010%1%1%i1M 


OP = 2E, 2F 


Description: 


18-36 


CPBYTE 


Compare Bytes 


IF (SRCA.BYTEO = SRCB.BYTED) OR 
(SRCA.BYTE1 = SRCB.BYTE1) OR 
(SRCA.BYTE2 = SRCB.BYTE2) OR 
(SRCA.BYTE3 = SRCB.BYTES) THEN 
DEST — TRUE ELSE DEST < FALSE 


CPBYTE rc, ra, rb 
or 
CPBYTE rec, ra, const8 


Not affected 
SRCA 
SRCB 


Content of register RA 


M = 0: Content of register RB 
M = 1: I (Zero-extended to 32 bits) 


DEST Register RC 
23 15 7 0 


RB or | 
CPBYTE 


Each byte of the SRCA operand is compared to the corresponding byte 
of the SRCB operand. If any corresponding bytes are equal, a Boolean 
TRUE is placed into the DEST location; otherwise, a Boolean FALSE is 
placed into the DEST location. 


w 


Instruction Set 


amp &A 


CPEQ CPEQ 
Compare Equal To 
Operation: IF SRCA=SRCB THEN DEST < TRUE 
ELSE DEST < FALSE 
Assembler 
Syntax: CPEQ ce, ra, rb — 
or 
CPEQ rc, ra, const8 
Status: Not affected 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 
0110000M RC RA 7 | RB or | 
OP = 60, 61 CPEQ 
Description: If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is 


‘placed into the DEST location; otherwise, a Boolean FALSE is peer 
into the DEST location. | 


Instruction Set 18-37 


21 amp 


CPGE | CPGE 
Compare Greater Than or Equal To 


Operation: IF SRCA>SRCB THEN DEST < TRUE 
ELSE DEST <— FALSE 


Assembler 
Syntax: CPGE rc, ra, rb 
or 
CPGE re, ra, const8 


Status: Not affected 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 “45 | 7 | 0 


OP = 4C, 4D | CPGE 


Description: _ If the value of the SRCA operand is greater than or equal to the value of 
the SRCB operand, a Boolean TRUE is placed into the DEST location; 
otherwise, a Boolean FALSE is placed into the DEST location. 


18-38 . | Instruction Set 


CPGEU 


Operation: 


Assembler 
Syntax: 
Status: 


Operands: 


31 


AMD br 
CPGEU 


_ Compare Greater Than or Equal To, Unsigned 


IF SRCA > SRCB (unsigned) THEN DEST <— TRUE 
ELSE DEST < FALSE 


CPGEU re, ra, rb 
or | 
CPGEU rec, ra, const8 


Not affected | | | 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 15 7 | 0 


OP = 4E, 4F CPGEU 


Description: 


If the value of the SRCA operand is greater than or equal to the value of 
the SRCB operand, a Boolean TRUE is placed into the DEST location; 


otherwise, a Boolean FALSE is placed into the DEST location. For the 


comparison, both operands are treated as unsigned integers. 
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al AMD 
CPGT 


Operation: 


Assembler 
Syntax: 
Status: 


Operands: 


31 


CPGT 
Compare Greater Than 


IF SRGA > SRCB THEN DEST <— TRUE 
ELSE DEST «+ FALSE 


CPGT re, ra, rb 
or 


_ CPGT re, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M=0: Content of register RB. 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 15 7 -0 


OP = 48, 49 | CPGT 


Description: 


18-40 


If the value of the SRCA operand is greater than the value of the SRCB 
operand, a Boolean TRUE is placed into the DEST location; otherwise, 
a Boolean FALSE is placed into the DEST location. 


Instruction Set 


AMD cl 





CPGTU CPGTU 
Compare Greater Than, Unsigned 


Operation: IF SRCA > SRCB (unsigned) THEN DEST « TRUE 
ELSE DEST < FALSE 


Assembler _ 
Syntax: CPGTU rec, ra, rb 
| or 
CPGTU re, ra, const8 | 


Status: Not affected 


Operands: SRCA Content of register RA 
SRCB ~ M=0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 
| 
OP = 4A, 4B CPGTU | | 


Description: If the value of the SRCA operand is greater than the value of the SRCB 
oo operand, a Boolean TRUE is placed into the DEST location; otherwise, 
a Boolean FALSE is placed into the DEST location. For the comparison, 
both operands are treated as unsigned integers. 


Instruction Set | 18-41 


21 amp 
| CPLE 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 





31 23 15 7 ) 


| CPLE 
Compare Less Than or Equal To | 


IF SRCA < SRCB THEN DEST < TRUE 
ELSE DEST < FALSE 3 


CPLE re, ra, rb 


or 
CPLE re, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB . M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 





OP = 44, 45 CPLE 


Description: 


18-42 


If the value of the SRCA operand is less than or equal to the value of 
the SRCB operand, a Boolean TRUE is placed into the DEST location; 
otherwise, a Boolean FALSE is placed into the DEST location. 


Instruction Set 


- CPLEU 


AMD c\ 


CPLEU 
Compare Less Than or Equal To, Unsigned 
Operation: IF SRCA <SRCB (unsigned) THEN DEST ~— TRUE 
ELSE DEST < FALSE » 
Assembler | 
Syntax: CPLEU rc, ra, rb 
or 
CPLEU re, ra, const8 
Status: Not affected | 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: I (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 | fe) 
01000%1%1M RC RA RB or | 
OP = 46, 47 CPLEU 
Description: lf the value of the SRCA operand is less than or equal to the value of 


the SRCB operand, a Boolean TRUE is placed into the DEST location; 
otherwise, a Boolean FALSE is placed into the DEST location. For the 
comparison, both operands are treated as unsigned integers. 


Instruction Set | 1 8-43 


£4 amp 
CPLT CPLT 
Compare Less Than 


Operation: IF SRCA<SRCB THEN DEST — TRUE 
ELSE DEST «+ FALSE 


Assembler 
Syntax: CPLT rc, ra, rb 
| or 
CPLT re, ra, const8 


Status: Not affected | | 
Operands: SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7. 0 
OP = 40, 41 3 — CPLT 


Description: _ If the value of the SRCA operand is less than the value of the SRCB 
operand, a Boolean TRUE is placed into the DEST location; otherwise, 
a Boolean FALSE is placed into the DEST location. 


18-44 | instruction Set 


CPLTU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD cl 


| CPLTU 
Compare Less Than, Unsigned 


IF SRCA < SRCB (unsigned) THEN DEST < TRUE 
ELSE DEST «+ FALSE 


CPLTU re, ra, rb 
or | 
CPLTU re, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 15 7 0 


OP = 42, 43 CPLTU 


‘Description: 


If the value of the SRCA operand is less than the value of the SRCB 
operand, a Boolean TRUE is placed into the DEST location; otherwise, 
a Boolean FALSE is placed into the DEST location. For the comparison, 
both operands are treated as unsigned integers. 


Instruction Set 18-45 


£1 amp 
| CPNEQ 


Operation: | 


Assembler 
Syntax: 


Status: 
Operands: 


31: 


CPNEQ 
Compare Not Equal To 


IF SRCA <> SRCB THEN DEST < TRUE 
ELSE DEST <— FALSE — 


CPNEQ te, ra, rb 
or. 
CPNEQ re, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 45 | 7 0 


OP =62,63 CPNEQ 


Description: 


18-46 


If the SRCA operand is not equal to the SRCB operand, a Boolean | 
TRUE is placed into the DEST location; otherwise, a Boolean FALSE is 
placed into the DEST location. 


Instruction Set 


DADD 


Operation: 


Assembler 


Syntax: 
Status: 
Operands: 


31 


AMD & 


DADD 
Floating-Point Add, Double-Precision 


DEST (double- preceod) «<- SRCA (double- nicision) + 


SRCB (double-precision) 


DADD re, ra, rb 

fpX, fpU, fpV, fpR, foN 

SRCA Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC and the twin of register RC 


23 15 7 0 


OP =F1 DADD 


Description: 


The SRCA operand is added to the SRCB operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 
result of the addition are double-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DADD trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 18-47 


cl AMD 
DDIV 
Operation: 
Assembler 
Syntax: 


Status: 
Operands: 


31 


DDIV 
Floating-Point Divide, Double-Precision 

DEST (double-precision) — SRCA (double-precision) / 
| SRCB (double-precision) 
DDIV re, ra, rb 
fpD, fpX, fpU, fpV, fpR, foN 
SRCA Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC and the twin of register RC 

23 15 7 0 


OP = F7 | DDIV 


Description: 


18-48 


The SRCA operand is divided by the SRCB operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 
result of the division are double-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DDIV trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 


DEQ 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD &\ | 


DEQ 
Floating-Point Equal To, Double-Precision 

IF SRCA (double-precision) = SRCB (double-precision) 
THEN DEST <— TRUE 
ELSE DEST < FALSE 
DEQ re, ra, rb 
fpl 
SRCA Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC 

23 45 7 0 


OP = EB DEQ 


Description: 


If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is 
placed into the DEST location; otherwise, a Boolean FALSE is placed 
into the DEST location. SRCA and SRCB are double-precision 
floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 
Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DEQ trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set - 18-49 


4A amp 
DGE | DGE 


Floating-Point Greater Than Or Equal To, Double-Precision 
Operation: IF SRCA (double-precision) > SRCB (double-precision) 


THEN DEST <— TRUE 
ELSE DEST < FALSE 


Assembler 
Syntax: DGE re, ra, rb 
Status:  fpl 
Operands: SRCA Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC 
3 23 15 7 0 
vores} ome | om fe 
OP = EF DGE 


Description: _ If the SRCA operand is greater than or equal to the SRCB operand, a 
| _Boolean TRUE is placed into the DEST location; otherwise, a Boolean 
FALSE is placed into the DEST location. SRCA and SRCB are . 
double-precision floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 
Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DGE trap. When 

_the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


18-50 Instruction Set 


DGT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD ct 


- DGT 
Floating-Point Greater Than, Double-Precision 
IF SRCA (double-precision) > SRCB (double-precision) 
THEN DEST < TRUE 
ELSE DEST + FALSE 

DGT re, ra, rb 
fpl 
SRCA Content of register RA and the twin of register RA 

- SRCB Content of register RB and the twin of register RB 
DEST | Register RC 

23 15 7 0 


OP =ED | DGT 


Description: 


lf the SRCA operand is greater than the SRCB operand, a Boolean 
TRUE is placed into the DEST location; otherwise, a Boolean FALSE is 
placed into the DEST location. SRCA and SRCB are double- -precision 
floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 
Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DGT trap. When the 


_ trap occurs, the IPA, IPB, and IPC registers are set to reference SRCA, 


SRCB, and DEST. 
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ci AMD 
DIV 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| DIV 
Divide Step 
Perform one-bit step of a divide operation (unsigned) 
DIV re, ra, rb 
or 
DIV Ie, ra, const 8 
V, N, Z, C 
SRCA Content of register RA 
SRCB M=0: Content of register RB 
: M=1: | (Zero-extended to 32 bits) 
DEST Register RC 
23 | 15 7 0 


OP = 6A, 6B | DIV 


Description: 


18-52 


If the Divide Flag (DF) bit of the ALU Status Register is 1, the SRCB 
operand is subtracted from the SRCA operand. If the DF bit is 0, the 
SRCB operand is added to the SRCA operand. 


The carry-out of the add or subtract operation is exclusive-ORed with 
the value of the DF bit and the value of the Negative (N) bit of the ALU 


Status Register; the resulting value is complemented and placed into 


the DF bit. The sign of the result of the add or subtract is placed into the 
N bit. | : 

The content of the Q Register is appended to the result of the add or 
subtract, and the resulting 64-bit value is shifted left by one bit position; 
the value computed for the DF bit above fills the vacated bit position. 
The high-order 32 bits of the 64-bit shifted value are placed into the 
DEST location. The low-order 32 bits of the shifted value are placed into 
the Q Register. 


Examples of integer divide operations appear in Section 2.6.3. 


Instruction Set 


DIVO 


Operation: | 


| Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD ct 


DIVO 
Divide Initialize 
Initialize for a sequence of divide steps (unsigned) 
DIVO re, rb 
or 
DIVO rc, const8 
V,N, Z, © 
SRCB M=0: Content of register RB 
M=1: {| (Zero-extended to 32 bits) 
DEST Register RC 
23 15 | 7 0 


OP = 68, 69 | | DIVO 


Description: 


The Divide Flag (DF) bit of the ALU Status Register is set. The sign of 
the SRCB operand is placed into the Negative bit of the ALU Status 
Register. | 


The content of the Q register is appended to the SRCB operand, and 
the resulting 64-bit value is shifted left by one bit position; a 0 fills the 
vacated bit position. The high-order 32 bits of the 64-bit shifted value 
are placed into the DEST location. The low-order 32 bits of the shifted 
value are placed into the Q Register. | 


Examples of integer divide operations appear in Section 2.6.3. 


Instruction Set 18-53 


a4 amo 


DIVIDE 


Operation: 


- Assembler 
Syntax: 


Status: 
Operands: 


31 


1110000 1 RC RA 


OP =E1 


Description: 


18-54 


| DIVIDE 
Integer Divide, Signed 
DEST <— (Q // SRCA) / SRCB (signed) 
Q — Remainder 
DIVIDE re, ra, rb 
Not affected | 
Q Content of the Q Register 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 
23 15 | 7 0 


DIVIDE 


The SRCA operand is appended to the content of the Q register. The 
resulting 64-bit value is divided by the SRCB operand and the result is 
placed into the DEST location. This operation treats the operands as 
signed two’s-complement integers and produces a signed 
two’s-complement result. 


The remainder is placed into the Q register. A non-zero remainder 
always has the same sign as the dividend. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DIVIDE trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 7 


Instruction Set 


AMD cl 


DIVIDU DIVIDU 
Integer Divide, Unsigned 
Operation: DEST <— (Q//SRCA) /SRCB (unsigned) 
Q < Remainder 
Assembler 
Syntax: DIVIDU re, ra, rb 
- Status: Not affected 
Operands: Q Content of the Q Register 
SRCA - Content of register RA 
SRCB Content of register RB 
DEST Register RC 
31 23 15 7 0 
11100011 RC : RA RB 
OP =E3 : DIVIDU | 
Description: . The SRCA operand is appended to the content of the Q Register. The 


resulting 64-bit value is divided by the SRCB operand and the result is 
placed into the DEST location. This operation treats the operands as 
unsigned integers, and produces an unsigned result. 


The remainder is placed into the Q Register. The remainder is also 
unsigned. 


Note: This instruction is not supported directly in processor hardware. In | 
the current implementation this instruction causes a DIVIDU trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 18-55 


cl AMD 


18-56 


DIVL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| DIVL 
Divide Last Step 
Complete a sequence of divide steps (unsigned) 
DIVL rc, ra, rb 
V, N, Z, C 
SRCA Content of register RA 
SRCB M=0: Content of register RB 
M=1: | (Zero-extended to 32 bits) 

DEST Register RC 

23 15 7 0 


OP = 6C, 6D DIVL 


Description: 


If the Divide Flag (DF) bit.of the ALU Status Register is 1, the SRCB 
operand is subtracted from the SRCA operand. If the DF bit is 0, | 

the SRCB operand is added to the SRCA operand. The result is placed 
into the DEST location. 


The carry-out of the add or subtract operation is exclusive-ORed with 
the value of the DF bit and the value of the Negative (N) bit of the ALU 
Status Register; the resulting value is complemented and placed into 
the DF bit. The sign of the result of the add or subtract is placed into the 
N bit. 


The content of the Q register is shifted left by one bit position; the value 
computed for the DF bit above fills the vacated bit position. The shifted . 
value is placed into the Q Register. 


Examples of integer divide operations appear in Section 2.6.3. 


instruction Set 


AMD &A 


DIVREM _ DIVREM 
Divide Remainder 


Operation: Generate remainder for divide operation (unsigned) 


Assembler 
Syntax: DIVREM ‘rc, ra, rb 
or 
DIVREM rec, ra, const8 


Status: V,N,Z,C 


Operands: SRCA | Content of register RA 
SRCB M=0: Content of register RB 
a M=1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 
OP =6E, 6F | DIVREM | 


Description: If the Divide Flag (DF) bit of the ALU Status Register is 1, the SRCA 
operand is placed into the DEST location. 


If the DF bit is 0, the SRCB operand is added to the SRCA operand and 
the result is placed into the DEST location. 


Examples of integer divide operations appear in Section 2.6.3. 


Instruction Set 18-57 


41 amp 
DMUL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| DMUL 
Floating-Point Multiply, Double-Precision 
DEST (double-precision) — SRCA (double-precision) * 
SRCB (double-precision) 
DMUL re, ra, rb 
fpX, fpU, fpV, fpR, foN 
SRCA Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC 
23 15 7 0 


OP = F5 | DMUL 


Description: 


18-58 


The SRCB operand is multiplied by the SRCA operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and is placed into the DEST location. The operands and the © 
result of the multiplication are double-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DMUL trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 


DSUB 


Operation: 


- Assembler | 


Syntax: 
Status: 
Operands: 


31 


AMD cl 
DSUB 


Floating-Point Subtract, Double-Precision 


DEST (double-precision) <— SRCA (double-precision) _ 
SRCB (double-precision) 


DSUB re, ra, rb 

fpX, fpU, fpV, fpR, fpN 

SRCA _ Content of register RA and the twin of register RA 
SRCB Content of register RB and the twin of register RB 
DEST Register RC 


23 15 7 0 


OP = F3 | DSUB 


Description: 


The SRCB operand is subtracted from the SRCA operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and is placed into the DEST location. The operands and the 
result of the subtraction are double-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a DSUB trap. When 


‘the trap occurs, the IPA, IPB, and IPC registers are set to reference 


SRCA, SRCB, and DEST. 


Instruction Set 18-59 


o1 amo 


EMULATE ~ EMULATE 
Trap to Software Emulation Routine 
Operation: Load IPA and IPB registers with operand register numbers 
and Trap (VN) : 


Assembler 
Syntax: EMULATE vn, ra, rb 


.Status: Not affected 
Operands: Absolute-register numbers for registers RA and RB 
VN — Trap vector number 


31 23 15 7 0 
OP = D7 | EMULATE © | 


Description: The IPA and IPB registers are set to the register numbers of registers 
RA and RB, respectively. A trap with the specified vector number 
Occurs. | 


Note that the IPC register is also affected by this instruction, but its 
value has no interpretation. 


For programs in the User mode, a Protection Violation trap occurs— 

instead of the EMULATE trap—if a vector number between 0 and 693 is 
specified. A Protection Violation trap also occurs if RA or RB specifies a - 
register protected by the Register Bank Protect Register. 


18-60 | Instruction Set 


AMD cl 


EXBYTE EXBYTE 
Extract Byte 
Operation: DEST < SRCB, with low-order byte replaced by byte in 
SRCA selected by BP 
Assembler 
Syntax: EXBYTE rc, ra, rb 
or 
EXBYTE re, ra, const8 
Status: Not affected 
Operands: SRCA Content of register RA 
| SRCB M=0: Content of register RB 
M=1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 | 15 7 0 
0000%101M RC RA ~ RBorl 
OP = 0A, 0B . EXBYTE 
Description: A byte in the SRCA operand is selected by the Byte Pointer (BP) field of 


the ALU Status Register. The selected byte replaces the low-order byte 
of the SRCB operand and the resulting word is placed.into the DEST 
location. 


Note: The selection of bytes within words is specified in 
Section 3.3.5.1. 


instruction Set 18-61 


£1 amo 


EXHW | - EXHW 
Extract Half-Word 
Operation: DEST < SRCB, with low-order half-word replaced by half-word in 
SRCA selected by BP 
Assembler 
Syntax: EXHW rc, ra, rb 


or 
EXHW rec, ra, const8 


Status: Not affected 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M=1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 
OP =7C, 7D EXHW 


Description: A half-word in the SRCA operand is selected by the Byte Pointer (BP) 
field of the ALU Status Register. The selected half-word replaces the 
low-order half-word of the SRCB operand and the resulting word is 
placed into the DEST location. 


Note: The selection of half-words within words is specified in 
Section 3.3.5.1. 


18-62 Instruction Set 


AMD &A 
EXHWS EXHWS 
Extract Half-Word, Sign-Extended | 
Operation: DEST < half-word in SRCA selected by BP, 


sign-extended to 32 bits 


Assembler | | 
Syntax: EXHWS rc, ra 


Status: Not affected 


Operands: SRCA Content of register RA 
DEST Register RC 
31 23 15 7 0 
OP=7E _EXHWS 


Description: A half-word in the SRCA operand is selected by the Byte Pointer (BP) 
field of the ALU Status Register. The selected half-word is 
sign-extended to 32 bits and the resulting word is placed into the DEST 
location. , 


Note: The selection of half-words within words is specified in 
Section 3.3.5.1. | 


Instruction Set | 18-63 


£1 amp 
EXTRACT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


, EXTRACT 
Extract Word, Bit-Aligned 


DEST < high-order word of (SRCA // SRCB << FC) 


EXTRACT re, ra ,rb 
or 
EXTRACT re, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M =0: Content of register RB 
_ Me=1: 1 (Zero-extended to 32 bits) 


DEST | Register RC 


23 | 15 7 0 


OP = 7A, 7B EXTRACT 


Description: 


18-64 


The SRCB operand is appended to the SRCA operand and the resulting 
64-bit value is shifted left by the number of bit-positions specified by the 

Funnel Shift Count (FC) field of the ALU Status register. The high-order 

32 bits of the 64-bit shifted value are placed in the DEST location. 


If the SRCB operand is the same as the SRCA operand, the EXTRACT 


- instruction performs a rotate operation. 


Instruction Set 


AMD de 


FADD FADD 
Floating-Point Add, Single-Precision 
Operation: DEST (single-precision) — SRCA (single-precision) + 
SRCB (single-precision) 
Assembler 
Syntax: FADD rc, ra, rb . 
Status: fpX, fpU, fpV, fpR, foN 
Operands: SRCA Content of register RA 
| SRCB Content of register RB 
DEST Register RC 
31 23 15 7 0 


OP = FO FADD 


Description: 


The SRCA operand is added to the SRCB operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 


_ result of the addition are single-precision floating-point numbers. 
Note: This instruction is not supported directly in processor hardware. In 


the current implementation this instruction causes an FADD trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 18-65 — 


o1 amo 


FDIV 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


FDIV 
Floating-Point Divide, Single-Precision 


DEST ingle: precision) — SRCA (single-precision) / 
SRCB (single-precision) 


FDIV re, ra, rb 

fpD, fpX, fpU, fpV, fpR, fpN | 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 


23 15 7 0 


OP = F6 FDIV 


Description: 


18-66 


The SRCA operand is divided by the SRCB operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 
result of the division are single-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FDIV trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. | 


Instruction Set 


FDMUL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD a 


FDMUL 
Floating-Point Multiply, Single-to-Double Precision 
DEST (double-precision) — SRCA (single-precision) * 
| SRCB (single-precision) 

FDMUL re, ra, rb 
fpR, fpN 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 

23 15 7 O 


OP = F9 FDMUL 


Description: 


The SRCB operand is multiplied by the SRCA operand; the result is 
placed into the DEST location. SRCA and SRCB are single-precision 
floating-point numbers; the result is produced in double-precision 
format. Because the product of two single-precision operands can 
always be represented exactly as a double-precision number, the 
FDMUL result does not depend on the FRM field of the Floating-Point 
Environment Register. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FDMUL trap. 
When the trap occurs, the IPA, IPB, and IPC registers are set to 
reference SRCA, SRCB, and DEST. 


Instruction Set 18-67 


£4 amp 
_ -FEQ | | FEQ 
Floating-Point Equal To, Single-Precision 
Operation: IF SRCA (single-precision) = SRCB (single-precision) 


THEN DEST < TRUE 
ELSE DEST <— FALSE 


Assembler 
Syntax: FEQ re, ra, rb 


Status: fpN 


~ Operands: SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 
31 23 15 7 ) 
evororo] me | om] me 
OP=EA FEQ 


Description: _ If the SRCA operand is equal to the SRCB operand, a Boolean TRUE is 
placed into the DEST location; otherwise, a Boolean FALSE is placed 
into the DEST location. SRCA and SRCB are single-precision 
floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 
_ Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FEQ trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. , 


18-68 | | Instruction Set 


AMD Ll 


FGE | FGE 
Floating-Point Greater Than Or Equal To, Single-Precision 
Operation: iF SRCA (single-precision) > SRCB (single-precision) 
THEN DEST <— TRUE 
ELSE DEST < FALSE 
Assembler 
Syntax: FGE rc, ra, rb 
Status: fpN | 
Operands: SRCA Content of register RA 
| SRCB Content of register RB 
DEST Register RC 
31 23 15 : 7 0 


OP = EE FGE 


Description: 


If the SRCA operand is greater than or equal to the SRCB operand, a 
Boolean TRUE is placed into the DEST location; otherwise, a Boolean 
FALSE is placed into the DEST location. SRCA and SRCB are 
single-precision floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 


Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FGE trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 7 


Instruction Set | 18-69 


oN amo 


FGT | FGT 
Floating-Point Greater Than, Single-Precision 
Operation: _IF SRCA (single-precision) > SRCB (single-precision) 


THEN DEST <— TRUE 
ELSE DEST < FALSE 


Assembler | 
Syntax: FGT rc, ra, rb 
Status: fpN 

Operands: — SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 

31 23 15 7 0 

fevortoo ome] om fm 


OP =EC FGT 


Description: _ If the SRCA operand is greater than the SRCB operand, a Boolean 
TRUE is placed into the DEST location; otherwise, a Boolean FALSE is 
. placed into the DEST location. SRCA and SRCEB are single-precision 
floating-point numbers. 


The rounding mode specified by the FRM field of the Floating-Point 
Environment Register has no effect on this operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FGT trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


18-70 Instruction Set 


FMUL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD él 


FMUL 
Floating-Point Multiply, Single-Precision 
_ DEST (single-precision) <— SRCA (single-precision) c 
SRCB (single-precision) 
FMUL re, ra, rb 
fpX, fpU, fpV, fpR, fpN 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 
23 15 7 0 


OP = F4 | | FMUL 


Description: 


The SRCA operand is multiplied by the SRCB operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 
result of the multiplication are single-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an FMUL trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 7 


Instruction Set 3 18-71 


iT amp 
FSUB 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| FSUB 
Floating-Point Subtract, Single-Precision 
DEST (single-precision) — SRCA (single-precision) — 
SRCB (single-precision) 
FSUB re, ra, rb 
fpX, fpU, fpV, fpR, foN 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 
23 15 7 0 


OP = F2 FSUB 


Description: 


18-72 


The SRCB operand is subtracted from the SRCA operand; the result is 
rounded according to the FRM field of the Floating-Point Environment 
Register and placed into the DEST location. The operands and the 
result of the subtraction are single-precision floating-point numbers. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation, this instruction causes an FSUB trap. When 
the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 


AMD &\ 


HALT , 7 , HALT 
Enter Halt Mode ) 


Operation: Enter Halt mode on next cycle 


Assembler 
Syntax: HALT 


Status: Not affected 
Operands: Not applicable 


31 23 15 7 0 


10001001 _ Reserved , Reserved Reserved 


OP = 89 | HALT 


Description: The processor is placed into the Halt mode in the next cycle, or in the 
cycle after an external data access is completed if an access is in 
progress. | 


This instruction may be executed only by Supervisor-mode programs. 
An attempted execution by a User-mode program causes a Protection 
Violation trap to occur unless the Protection Violation trap was disabled 
during reset (See Section 17.7.5). 


If the instruction following a Halt instruction has an exception 
(e.g., TLB Miss), the trap associated with this exception is taken before 
_the processor enters the Halt mode. 


' Instruction Set 18-73 


1 amo 


INBYTE 


Operation: 


Assembler 


Syntax: 


Status: 


Operands: 


31 


INBYTE 
Insert Byte 


DEST <-SRCA, with byte selected by BP 
replaced by low-order byte of SRCB 


INBYTE re, ra, rb 
or 
INBYTE re, ra, const8 


Not affected | 
SRCA Content of register RA 


SRCB M=0: Content of registerRB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 . 15 7 0 


OP =0C, 0D INBYTE 


Description: 


18-74 


A byte in the SRCA operand is selected by the Byte Pointer (BP) field of 
the ALU Status Register. The selected byte is replaced by the low-order 
byte of the SRCB operand and the resulting word is aati into the 
DEST location. 


Note: The selection of bytes within words is specified in 
Section 3.3.5.1. 


instruction Set 


INHW 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


AMD &\ | 


INHW 
Insert Half-Word 


DEST <-SRCA, with half-word selected by BP replaced by 


low-order half-word of SRCB 

INHW re, ra, rb 

or | 
INHW rc, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 

M = 1: | (Zero-extended to 32 bits) 

DEST Register RC 


23 15 7 0 


31 


OP = 78, 79 © INHW 


Description: 


A half-word in the SRCA operand is selected by the Byte Pointer (BP) 

field of the ALU Status Register. The selected half-word is replaced by 
the low-order half-word of the SRCB operand and the resulting word is 
placed into the DEST location. | 


Note: The selection of haif-words within words is specified in | 
Section 3.3.5.1. : 


Instruction Set | 18-75 


21 amo 


invalidate 


Operation: None 


Assembler 
Syntax: INV 


Status: Not affected 
Operands: Not applicable 


31 23 15 7 . 0 
OP = 9F INV | 


Description: In 29K Family processors with instruction caches, this instruction causes 
all cache valid bits to be reset. In the Am29200 and Am29205 
microcontrollers, this instruction performs no operation, except it is a 
privileged instruction. Attempted execution by a User-mode program 
causes a Protection Violation trap to occur. 


18-76 Instruction Set 


AMD L\ 


IRET | IRET 
Interrupt Return 


Operation: Perform an interrupt return sequence 


Assembler 
Syntax: IRET 


Status: Not affected 
Operands: Not applicable 


31 23 15 7 0 
OP = 88 IRET 


Description: © This instruction performs the interrupt return sequence described in 
3 Section 16.4.4. | 


This instruction may be executed only by Supervisor-mode programs. 
An attempted execution by a User-mode program causes a Protection 
Violation trap to occur. | 


Instruction Set : 18-77 


1 amp | 
IRETINV | — IRETINV 
Interrupt Return and Invalidate | 


Operation: Perform an interrupt return sequence 


Assembler 
Syntax: IRETINV 


Status: Not affected 
Operands: Not applicable 


31 23 15 7 | 0 


| OP =8C IRETINV | 


Description: This instruction performs the interrupt return sequence described in 
Section 16.4.4. In 29K Family processors with an instruction cache, this 
instruction also resets the valid bits in the cache. In the Am29200 and 
Am29205 microcontrollers, this instruction is identical to the IRET 
instruction. 


This instruction may be executed only by Supervisor-mode programs. 
An attempted execution by a User-mode program causes a Protection 
Violation trap to occur. 


18-78 Instruction Set 


AMD fe\ 


JMP JMP 
Jump 


Operation: PC <— TARGET 
Execute delay instruction 


Assembler 
Syntax: JMP target 
Status: Not affected 


Operands: TARGET A=0:117... 110/19 ... l2 (sign-extended to 30 bits) + PC 
A=1:117... 10//I9 ... 12 (zero-extended to 30 bits) | 


31 : 23 15 7 fe) 
OP = AO, At JMP | 


Description: A non- -sequential instruction fetch occurs to the instruction address 
given by the TARGET operand. The instruction following the JMP is 
executed before the eee fetch occurs. 


Instruction Set 18-79 


£1 avo 


JMPF | JMPF 
Jump False | 


Operation: IF SRCA = FALSE THEN PC — TARGET 
Execute delay instruction 


Assembler _ 
Syntax: JMPF ra, target 


Status: Not affected 
Operands: SRCA Content of register RA 


TARGET A=0:117... 110 //1I9 ... 12 (sign-extended to 30 bits) +PC 
A=1:117...110//19 ... 12 (zero-extended to 30 bits) 


31 23 15 7 0. 
OP = A4, A5 JMPF 


Description: If SRCA is a Boolean FALSE, a non-sequential instruction fetch occurs 
to the instruction address given by the TARGET operand. 


lf SRCA is a Boolean TRUE, this instruction has no effect. 


The instruction following the JMPF is executed regardless of the value 
.Of SRCA. 


18-80 Instruction Set 


“” 


JMPFDEC 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


—«3t 


AMD cl 


| JMPFDEC 
Jump Faise and Decrement 


IF SRCA = FALSE THEN 
SRCA <- SRCA — 1 
PC <— TARGET 
ELSE 
SRCA < SRCA - 1 
Execute delay instruction 


JMPFDEC ra, target 
Not affected 
SRCA Content of register RA 


TARGET A=0: 117... 110 //19 ... 12 (sign-extended to 30 bits) + PC 
A=1:117... 110 //19 ... 12 (zero-extended to 30 bits) 


23 15 a 0 


OP = B4, B5 a JMPFDEC 


_Description: 


If SRCA is a Boolean FALSE, a non-sequential instruction fetch occurs 
to the instruction address given by the TARGET operand. 


lf SRCA is a Boolean TRUE, this instruction has no effect on the 
instruction-execution sequence. | 


The SRCA operand is decremented by one, regardless of whether or 
not the non-sequential instruction fetch occurs. Note that a negative 
number for the SRCA operand is a Boolean TRUE. 


The instruction following the JMPFDEC is executed regardiess of the 
value of SRCA. 


Instruction Set — 7 18-81 


£4 amp 
JMPFI | | JMPFI 
| Jump False Indirect 
Operation: IF SRCA = FALSE THEN PC — SRCB 
Execute delay instruction 


Assembler 
Syntax: JMPFl ra, rb 


Status: Not affected. 


Operands: SRCA Content of register RA 
SRCB Content of register RB 
31 23 15 . Z 0 
BSAee hak a0i MAAN BA 
OP = C4 JMPFI | 


Description: _ If the SRCA is a Boolean FALSE, a non-sequential instruction fetch 
occurs to the instruction address given by the SRCB operand. 


lf SRCA is a Boolean TRUE, this instruction has no effect. 


The instruction following the JMPFI is executed regardless of the value 
of SRCA. 


18-82 _ | Instruction Set 


AMD it 


JMPI | | JMPI 
Jump Indirect | 
Operation: PC <SRCB 
Execute delay instruction 


Assembler 
Syntax: JMPI-rb 


Status: Not affected — 


Operands: SRCB Content of register RB 
31 | 23 15 7 6) 
11000000 —rasmea | ramnea | me | 


OP =CO | JMPI 


Description: A non-sequential instruction fetch occurs to the instruction address 
given by the SRCB operand. The instruction following the JMPI is 
executed before the non-sequential fetch occurs. 


18-83 


Instruction Set 


amo 
JMPT 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


1010 1 


JMPT 
Jump True 


IF SRCA = TRUE THEN PC — TARGET 
Execute delay instruction 


JMPT ra, target 

Not affected 

SRCA Content of register RA | 

TARGET A=0:117... 110/19 ... 12 (sign-extended to 30 bits) + PC 
Az=1:117... 110/19 ... 12 (zero-extended to 30 bits) 


23 15 7 0 


OP = AC, AD JMPT 


Description: 


18-84 


If SRCA is a Boolean TRUE, a non-sequential instruction fetch occurs to. 
the instruction address given by the TARGET operand. 


If SRCA is a Boolean FALSE, this instruction has no effect. 


The instruction following the JMPT is executed regardless of the value 
of SRCA. 3 


Instruction Set 


AMD fe 


JMPTI | | | JMPTI 
_ Jump True Indirect 
Operation: IF SRCA = TRUE THEN PC « SRCB 
Execute delay instruction 


Assembler 
Syntax: JMPTI ra, rb 


Status: Not affected 


Operands: SRCA Content of register RA 
SRCB Content of register RB 
31 | 23 15 7 0 
Oc ea ak All 
| OP = CC JMPTI 


Description: _ If the SRCA is a Boolean TRUE, a non-sequential instruction fetch 
occurs to the instruction address given by the SRCB operand. 


lf SRCA is a Boolean FALSE, this instruction has no effect. 


The instruction following the JMPTI is executed regardless of the value 
of SRCA. © 


Instruction Set 18-85 


el AMD 
LOAD 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


LOAD 
Load 


DEST <— EXTERNAL WORD [SRCB] 


LOAD 0, cntl, ra, rb 
or. | 
LOAD 0, entl, ra, const8 


Not affected 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RA 


23 15 7 0 


OP = 16, 17 ' LOAD 


Description: 


18-86 


Res 


The external word addressed by the SRCB operand is placed into the 
DEST location. 


The CNTL field of the LOAD instruction affects the bus access as 
described in Section 3.3.1. 


Instruction Set 


AMD ci 


. LOADL | LOADL 
Load and Lock 


Operation: DEST <— EXTERNAL WORD [SRCB] 


Assembler | 
Syntax: LOADL 0, cnitl, ra, rb 
or 
LOADL 0, cntl, ra, const8 


Status: Not affected 


Operands: SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RA | 
31 | 23 15 7 0 
jeoooorrm om | mB 
OP = 06, 07 LOADL 
Res 


Description: The external word addressed by the SRCB operand is placed into the . 
DEST location. | 


The CNTL field of the LOADL instruction affects the bus access as 
described in Section 3.3.1. 


In other 29K Family processors, this instruction is provided for.the 
implementation of interlock protocols. In the Am29200 and Am29205 
microcontrollers, the LOADL instruction is identical to the LOAD 
instruction. 


Instruction Set 18-87 


Ll AMD 
LOADM 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


LOADM 
Load Multiple 


DEST... DEST+COUNT <— EXTERNAL WORD [SRCB] ... 
EXTERNAL WORD [SRCB + (COUNT * 4)] 


LOADM 0, cntl, ra, rb 
or | 
LOADM 0, cntl, ra, const8 


Not affected 


SRCB M = 0: Content of register RB 
7 M = 1: | (Zero-extended to 32 bits) 


DEST register RA 


23 15 7 0 


OP = 36, 37 ; LOADM 


Description: 


18-88 


Res 


- External words at consecutive word addresses beginning with the word 


addressed by the SRCB operand, are placed into consecutive registers 
beginning with the DEST location. 


The total number of words accessed in.the sequence is specified by the 
Count Remaining (CR) field of the Channel Control Register (which also 
appears in the Load/Store Count Remaining Register) at the beginning 
of the bus access. The total number of words is the value of the CR field 
plus one. The CNTL field of the LOADM instruction affects the access 
as described in Section 3.3.1. 


Note: The address and register-number sequences for the LOADM 
instruction are specified in Section 3.3.4. Because this instruction uses 
the Channel Address and Control Registers, it should not be executed 
when the FZ bit is 1. 


Instruction Set 


LOADSET 


Operation: 


Assembler 
Syntax: 


‘Status: 
Operands: 


31 


AMD ci 
LOADSET 
Load and Set 


DEST < EXTERNAL WORD [SRCB] 
EXTERNAL WORD [SRCB] <— h'FFFFFFFF’ 


LOADSET 0, cntl, ra, rb 
or 
LOADSET 0, cntl, ra, const8 


Not affected 


SRCB M = 0: Content of register RB 
‘M = 1: | (Zero-extended to 32 bits) 


DEST Register RA 


23 15 7 0 


OP = 26, 27 - LOADSET 


Description: 


Res 


The external word addressed by the SRCB operand is placed into the 
DEST location. After the DEST location is altered, the external word 
addressed by the SRCB operand is written, atomically, with a word 
consisting of a 1 in every bit position. | | 

The CNTL field of the LOADSET instruction affects the bus access as 
described in Section 3.3.1. 


Instruction Set 18-89 


| AMD 
MFSR MFSR 


Move from Special Register 


Operation: - DEST <— SPECIAL 


Assembler 
Syntax: MFSR re, spid 


- Status: Not affected 3 
Operands: SPECIAL Content of special-purpose register SA 


DEST _ Register RC 
31 : 23 15 7 0 
OP =C6 _ -MFSR 


Description: The SPECIAL operand is placed into the DEST location. 


For programs in the User mode, a Protection Violation trap occurs if SA 
specifies a protected special-purpose register. If a trap occurs, the 
DEST location is not altered. | | 


18-90 Instruction Set 


AMD at 


MFTLB MFTLB 
Move from Translation Look-Aside Buffer Register 


Operation: None 


Assembler 
Syntax: MFTLB re, ra 


Status: Not affected 


Operands: SRCA — Content of register RA, bits 6 ... 0 
DEST Register RC 
31 230 | 15 7 0 
TPS AS MAN AS 
OP = B6 MFTLB 


Description: In 29K Family processors with an MMU, this instruction reads TLB 
- entries. In the Am29200 and Am29205 microcontrollers, this instruction 
performs no operation except it is a privileged instruction. Attempted 
execution by a User-mode program causes a Protection Violation trap to 
occur. . 


Instruction Set . 18-91 


Lt AMD 


MTSR | _ MTSR 
Move to Special Register 


Operation: SPDEST<SRCB 


Assembler | 
Syntax: MTSR spid, rb 


Status: Not affected unless the destination is the ALU Status Register 
Operands: SRCB Content of register RB 
SPDEST Special-purpose register SA 


31 ; 23 15 7 0 
OP = CE | MTSR : 


Description: The SRCB operand is placed into the SPECIAL location. 


For programs in the User mode, a Protection Violation trap occurs if SA 
specifies a protected special-purpose register. If a trap occurs, the 
SPDEST location is not altered. 


18-92 Instruction Set 


AMD ct 


MTSRIM | MTSRIM 
Move to Special Register Immediate 


Operation: SPDEST < 0116 


Assembler 7 
Syntax: MTSRIM spid, const16 


Status: Not affected unless the destination is the ALU Status Register 
Operands: 0116 115... 18 // 17 ... 10 (zero-extended to 32 bits) 
SPDEST Special-purpose register SA 


31. 23 15 7 — Q 
OP = 04  MTSRIM 


Description: The 0116 operand is placed into the SPECIAL location. 


For programs in the User mode, a Protection Violation trap occurs if SA 
specifies a protected special-purpose register. If a trap occurs, the 
SPDEST location is not altered. 


Instruction Set be: : 18-93 


(Dl amo 
MTTLB 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


| MTTLB 

Move to Translation Look-Aside Buffer Register | 
None 
MTTLB ra, rb 
Not affected 
SRCA Content of register RA, bits 6...0 
SRCB - Content of register RB 

23 | 15 7 0 


OP =BE MTTLB 


Description: 


18-94 


In 29K Family processors with an MMU, this instruction modifies TLB 
entries. In the Am29200 and Am29205 microcontrollers, this instruction 
performs no operation except it is a privileged instruction. Attempted 
execution by a User-mode program causes a Protection Violation trap to 
occur. | 


Instruction Set 


MUL 


Operation: 


_ Assembler 
Syntax: 


Status: 


Operands: | 


31 


AMD ct 


MUL 
Multiply Step 
Perform one-bit step of a multiply operation 
MUL re, ra, rb 
or 
MUL re, ra, const 8 
V,N, Z, C 
SRCA > Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
23 15 7 9 


OP = 64, 65 MUL 


Description: 


If the least significant bit of the Q Register is 1, the SRCA operand is 
added to the SRCB operand. If the least significant bit of the Q register . 
is 0, a zero word is added to the SRCB operand. 


The content of the Q Register is appended to the result of the add and 


. the resulting 64-bit value is shifted right by one bit position; the true sign 


of the result of the add fills the vacated bit position (i.e., the sign of the 
result is complemented if an overflow occurred during the add 
operation). The high-order 32 bits of the 64-bit shifted value are placed 
into the DEST location. The low-order 32 bits of the shifted value are 
placed into the Q Register. 


Examples of integer multiply operations appear in Section 2.6.2. 


Instruction Set . 18-95 


a\ AMD 


MULL MULL 
Multiply Last Step 
Operation: Complete a sequence of multiply steps (for signed multiply) 
Assembler i | 
Syntax: MULL re, ra, rb 


| or 
MULL re, ra, const 8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 


OP = 66, 67 | MULL 


Description: _ If the least significant bit of the Q Register is 1, the SRCA operand is 
subtracted from the SRCB operand. If the least significant bit of the Q 
register is 0, a zero word is subtracted from the SRCB operand. 


The content of the Q Register is appended to the result of the subtract 
and the resulting 64-bit value is shifted right by one bit position; the true 
sign of the result of the subtract fills the vacated bit position (i.e., the 
sign of the result is complemented if an overflow occurred during the 
subtract operation). The high-order 32 bits of the 64-bit shifted value are 
placed into the DEST location. The low-order 32 bits of the shifted value 
are placed into the Q Register. 


Examples of integer multiply operations appear in Section 2.6.2. 


18-96 — Instruction Set 


MULTIPLU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD cl 


MULTIPLU 
Integer Multiply, Unsigned 

DEST <— SRCA * SRCB 
MULTIPLU re, ra, rb 
None 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 

23 15 7 0 


OP = £2. ~ MULTIPLU 


Description: 


The SRCA operand is multiplied by the SRCB operand. The low-order | 
32 bits of the 64-bit result are placed into the DEST location. This 
operation treats the SRCA and SRCB operands as unsigned integers 
and produces an unsigned result. 


The contents of the Q register are undefined after a MULTIPLU 
operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an MULTIPLU trap. 
When the trap occurs, the IPA, IPB, and IPC registers are set to 
reference SRCA, SRCB, and DEST. 


Instruction Set | 18-97 


21 amp 
MULTIPLY 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


37 


MULTIPLY 


Integer Multiply, Signed 

DEST <— SRCA * SRCB 
MULTIPLY re, ra, rb 
None 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 

23 | 15 | 7 0 


OP = E0 MULTIPLY 


Description: 


18-98 


The SRCA operand is multiplied by the SRCB operand. The low-order 
32 bits of the 64-bit result are placed into the DEST location. This 
operation treats the SRCA and SRCB operands as two’s-complement 
integers and produces a two’s-complement result. 


The contents of the Q register are undefined after a MULTIPLY 
operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes a MULTIPLY trap. 
When the trap occurs, the IPA, IPB, and IPC registers are set to 
reference SRCA, SRCB, and DEST. 


Instruction Set 


MULTM 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD cl 


MULTM 
Integer Multiply most significant Bits, Signed 

DEST — SRCA * SRCB 
MULTM re, ra, rb 
None 
SRCA Content of register RA 
SRCB Content of register RB 
DEST Register RC 

23 45 7 0 


OP = DE | MULTM 


~ Description: 


The SRCA operand is multiplied by the SRCB operand. The high-order 
32 bits of the 64-bit result are placed into the DEST location. This 
operation treats the SRCA and SRCB operands as two’s-complement 
integers and produces a two’s-complement result. 


The contents of the Q register are undefined after a MULTM operation. 
Note: This instruction is not supported directly in processor hardware. In 


the current implementation this instruction causes a MULTM trap. When 


the trap occurs, the IPA, IPB, and IPC registers are set to reference 
SRCA, SRCB, and DEST. 


Instruction Set 18-99 


ct AMD 


MULTMU | MULTMU 
Integer Multiply most significant Bits, Unsigned 


Operation: DEST «— SRCA* SRCB 


Assembler 
Syntax: MULTMU rc, ra, rb 


Status: None 


Operands: SRCA Content of register RA 
SRCB Content of register RB - 
DEST Register RC 
31 | 23 15 | 7 0 
prover] ome fom | me 
OP = DF MULTMU — | 


Description: The SRCA operand is multiplied by the SRCB operand. The high-order 
32 bits of the 64-bit result are placed into the DEST location. This 
operation treats the SRCA and SRCB operands as unsigned integers 
and produces an unsigned result. . 


The contents of the Q register are undefined after a MULTMU operation. 


Note: This instruction is not supported directly in processor hardware. In 
the current implementation this instruction causes an MULTMU trap. 
When the trap occurs, the IPA, IPB, and IPC registers are set to 
reference SRCA, SRCB, and DEST. 


18-100 Instruction Set 


MULU 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD te 
MULU — 


Multiply Step, Unsigned 
‘Perform one-bit step of a multiply operation (unsigned) 
-MULU re, ra, rb 
or 
MULU re, ra, const 8 
V, N, Z, C : 3 
SRCA Content of register RA 
SRCB M=0: Content of register RB 
_  M=1:1 (Zero-extended to 32 bits) 
DEST Register RC 
23 15 | 7 0 


OP=74,75 — MULU 


Description: 


If the least significant bit of the Q Register is 1, the SRCA operand is 
added to the SRCB operand. If the least significant bit of the Q register 
is O, a zero word is added to the SRCB operand. 


The content of the Q register is appended to the result of the add and 
the resulting 64-bit value is shifted right by one bit position; the carry-out 
of the add fills the vacated bit position. The high-order 32 bits of the 
64-bit shifted value are placed into the DEST location. The low-order 32 
bits of the shifted value are placed into the Q Register. 


Examples of integer multiply operations appear in Section 2.6.2. 


Instruction Set 18-101 


£1 amo 


NAND | | | - NAND 
NAND Logical | 
Operation: DEST <— ~(SRCA & SRCB) 
Assembler 
Syntax: NAND re, ra, rb 
or 
NAND re, ra, const8 
Status: N,Z 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 | 7 0 
OP = 9A, 9B | NAND 


Description: The SRCA operand is logically ANDed, bit-by-bit, with the SRCB 
operand. The one’s-complement of the result is placed into the DEST 
location. 


18-102 Instruction Set 


NOR 


Operation: 


Assembler 
- Syntax: 


Status: 
Operands: 


31 


AMD cl 


NOR 
NOR Logical 
DEST <— ~(SRCA | SRCB) 
NOR re, ra, rb 
or . 
NOR re, ra, const8 
N, Z 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 

DEST ‘Register RC_ 

23 . 15 7 | 0 


OP = 98, 99 NOR 


Description: 


The SRCA operand is logically ORed, bit-by-bit, with the SRCB 
operand. The one’s-complement of the result is placed into the DEST 


location. © 


Instruction Set 18-103 


1 aw 
OR 


Operation: 


Assembler: 


Syntax: 


Status: 
Operands: 


31 


OR 
OR Logical 

DEST — SRCA | SRCB 
OR re, ra, rb 
or | 
OR re, ra, const8 
N,Z 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
| M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 

— 23 15 7 0 


OP = 92, 93 OR 


Description: 


18-104 


The SRCA operand is logically ORed, bit-by-bit, with the SRCB 
operand, and the result is placed into the DEST location. 


Instruction Set 


AMD & 


SETIP . SETIP 
Set Indirect Pointers 


Operation: Load IPA, IPB, and IPC registers with operand-register numbers 


Assembler | 
Syntax: SETIP re, ra, rb 


Status: Not affected | 
Operands: Absolute-register numbers for registers RA, RB, and RC 


31 23 15 7 0 


OP = 9E SETIP 


Description: The IPA, IPB, and IPC registers are set to the register numbers of 
registers RA, RB, and RC, respectively. 


For programs in the User mode, a Protection Violation trap occurs if RA, 
RB, or RC specifies a register protected by the Register Bank Protect 
Register. 


Note: This instruction has a delayed effect on the indirect BpInet 
registers as discussed in Section 5. 6. 





Instruction Set 18-105 


a\ AMD 
SLL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


SLL 
Shift Left Logical 
DEST — SRCA << SRCB (zero fill) 
SLL re, ra, rb 
or 
SLL re, ra, const8 
Not affected | 
SRCA Content of register RA 
SRCB ~Me=0: Content of register RB, bits 4... 0 
M = 1:1, bits 4...0 
DEST Register RC 
23 15 7 0 


OP = 80, 81 | SLL 


Description: 


18-106 


The SRCA operand is shifted left by the number of bit positions 
specified by the SRCB operand; zeros fill vacated bit positions. The 
result is placed into the DEST location. 


Instruction Set 


AMD cl 


SQRT | SQRT 
Floating-Point Square Root 
Operation: DEST — SQRT(SRCA) 
Assembler 
Syntax: SQRT rc, ra, FS 
Status: fpX, fpR, foN 
Operands: SRCA Content of register RA (single-precision floating-point) 
: or 
.Content of register RA and the twin of register RA 
| (double-precision floating-point) 
DEST Register RC (single-precision floating-point) 
or 
Register RC and twin of Register RC 
(double-precision floating-point) 
Control: FS Format of source operand SRCA 
00 Reserved for future use 
01 Single-precision floating-point 
10 Double-precision floating-point 
11 Reserved for future use 
31 23 15 7 0 
11100101 RC RA Reserved | FS 
OP =E5 | SQRT 
Description: This operation computes the square root of floating-point operand 


SRCA; the result is rounded according to the FRM field of the 
Floating-Point Environment Register and placed into the DEST location. 
The operand and result are single- or double-precision floating-point 
numbers as specified by FS. , 


Note: This iristruction is not supported directly in processor hardware. In 


the current implementation, this instruction causes an SQRT trap. When 
the trap occurs, the IPA and IPC registers are set to reference SRCA 
and DEST, and the IPB Register is set with the value of the FS field. 


instruction Set . 18-107 


£1 amp 
SRA 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


SRA 
Shift Right Arithmetic 


DEST — SRCA >> SRCB (sign fill) 


SRA re, ra, rb 
or 
SRA rc, ra, const8 


Not affected 
SRCA Content of register RA 


SRCB M = 0: Content of register RB, bits 4.... 0 
M = 1:1, bits 4...0 


DEST Register RC 


23 15 7 0 


OP = 86, 87 SRA 


Description: 


18-108 


The SRCA operand is shifted right by the number of bit positions 
specified by the SRCB operand; the sign of the SRCA operand fills 
vacated bit positions. The result is placed into the DEST location. 


Instruction Set 


SRL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD Lt 


SRL 
Shift Right Logical 
DEST <— SRCA >> SRCB (zero fill) 
SRL re, ra, rb — 
or 
SRL re, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB, bits 4... 0 
 M=1:1, bits 4...0 

DEST Register RC 

23 15 Z 0 


OP = 82, 83 | SRL 


Description: 


The SRCA operand is shifted right by the number of bit positions 
specified by the SRCB operand; zeros fill vacated bit positions. The 


_ result is placed into the DEST location. 


_Instruction Set _ . | 18-109 


oN amp 


STORE ; STORE 
Store 


Operation: EXTERNAL WORD [SRCB] <— SRCA 


Assembler 
Syntax: STORE 0, cnitl, ra, rb 
or 
STORE 0, cntl, ra, const8 


Status: Not affected 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
31 ° 23 15 7 ) 
BEORORE ACil BANA Rec 
OP =1E, 1F STORE 
Res . 


Description: The SRCA operand is placed into the external word addressed by the 
SRCB operand. 


The CNTL field of the STORE instruction affects the bus access as 
described in Section 3.3.1. - 


18-110 Instruction Set 


STOREL 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD te 


STOREL 
Store and Lock 
EXTERNAL WORD [SRCB] <— SRCA 
STOREL 0, cnitl, ra, rb 
or 
STOREL 0, cnil, ra, const8 
Not affected 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: 1 (Zero-extended to 32 bits) 
23 15 7 ) 


OP =0E, OF ; STOREL 


Description: 


Res 


The SRCA operand is placed into the external word addressed by the | 
SRCB operand. 

The CNTL field of the STOREL instruction affects the bus access as 
described in Section 3.3.1. 

In other 29K Family processors, this instruction is provided for the 
implementation of interlock protocols. In the Am29200 and Am29205 
microcontrollers, the STOREL instruction is identical to the STOR 
instruction. | 


Instruction Set 18-111 


o1 ao 


STOREM | ~ STOREM 
Store Multiple 


Operation; EXTERNAL WORD [SRCB] ... EXTERNAL WORD 
[SRCB + (COUNT * 4)] 
< SRCA... SRCA + COUNT 


Assembler 
Syntax: STOREMO, cnil, ra, rb 
. or 
STOREM 0, cntl, ra, const8 


Status: Not affected 


Operands: SRCA Content of register RA 
| SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
31 23 15. 7 0 
BEOnEnnc eae Se Ae Re 
OP = 3E, 3F ; STOREM 
Res 


Description: The contents of consecutive registers, beginning with the SRCA | | 
operand, are placed into external words at consecutive word addresses, 
beginning with the word addressed by the SRCB operand. 


The total number of words accessed in the sequence is specified by the 
Count Remaining (CR) field of the Channel Contro! Register (which also 
appears in the Load/Store Count Remaining Register) at the beginning 
of the bus access. The total number of words is the value of the CR field 
plus one. The CNTL field of the STOREM instruction affects the access 
as described in Section 3.3.1. 


Note: The address and register-number sequences for the STOREM 
instruction are specified in Section 3.3.4. Because this instruction uses 
the Channel Address, Data, and Control Registers, it should not be 
executed when the FZ bit is 1. 


18-112 , Instruction Set 


Amp &A 


SUB SUB 
Subtract 
Operation: DEST ~— SRCA-—SRCB 
Assembler 
Syntax: SUB re, ra, rb 
or 
SUB rc, ra, const8 
Status: V,N,Z,C 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 7 
31 23 15 : 7 ) 
OP = 24, 25 SUB 


The SRCA operand is added to the two’s-complement of the SRCB 


Description: 
operand and the result is placed into the DEST location. 


Instruction Set 18-113 


01 ano 


SUBC | SUBC 
Subtract with Carry 


Operation: DEST <SRCA-SRCB-1+C 


Assembler 
Syntax: SUBC re, ra, rb 
or 
SUBC rc, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB . 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 - 7 0 
OP = 2C, 2D SUBC 


Description: The SRCA operand is added to the one’s-complement of the SRCB 
operand and the value of the ALU Status Carry bit. The result is placed 


into the DEST location. 


18-114 Instruction Set 


SUBCS 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD cl 


SUBCS 
Subtract with Carry, Signed 


DEST — SRCA—-SRCB-1+C : 
IF signed overflow THEN Trap (Out of Range) 


SUBCS re, ra, rb 
or 
SUBCS rc, ra, const8 


V,N, Z, C 
SRCA Content of register RA 


SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 15 7. 0 . 


OP = 28, 29 SUBCS 


Description: 


The SRCA operand is added to the one’s-complement of the SRCB 
operand and the value of the ALU Status Carry bit. The result is placed 
into the DEST location. 


If the add operation causes a two’s-complement signed overflow, an 
Out-of-Range trap occurs. 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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64 amo 


SUBCU | SUBCU 
Subtract with Carry, Unsigned 


Operation: DEST <— SRCA-—SRCB-1+C 
| IF unsigned underflow THEN Trap (Out of Range) 


Assembler | 
Syntax: SUBCU rc, ra, rb 
or | 
SUBCU re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
3 23 15 7 0 
OP = 2A, 2B SUBCU 


Description: The SRCA operand is added to the one’s-complement of the SRCB 
operand and the value of the ALU Status Carry bit. The result is placed 
into the DEST location. 


If the add operation causes an unsigned underflow, an Out-of-Range 
trap occurs. 


Note that the DEST location is altered whether or not an underflow 
occurs. : | 


18-116 Instruction Set 
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SUBR SUBR 
Subtract Reverse 
Operation: DEST — SRCB-SRCA 
Assembler 
Syntax: SUBR rc, ra, rb 
or 
SUBR re, ra, const8 
Status: V,N,Z,C | 
Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 


OP = 34, 35 


Description: 


SUBR 


The SRCB operand is added to the two’s-complement of the SRCA 
operand and the result is placed into the DEST location. 
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SUBRC -SUBRC 
Subtract Reverse with Carry 


Operation: DEST <— SRCB-—-SRCA-1+C 


Assembler 
Syntax: SUBRC rec, ra, rb 
or 
SUBRC re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA _ Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 


OP = 3C, 3D SUBRC 


Description: The SRCB operand is added to the one’s-complement of the SRCA 
operand and the value of the ALU Status Carry bit. The eeu) is placed 
into the DEST location. 


18-118 Instruction Set 


SUBRCS 


Operation: © 


Assembler . 


Syntax: 


Status: 
Operands: 


31 
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SUBRCS 
Subtract Reverse with Carry, Signed 


DEST < SRCB-SRCA-1+C 
IF signed overflow THEN Trap (Out of Range) 


SUBRCS re, ra, rb 
or 
SUBRCS rc, ra, const8 


V, N, Z, C 
SRCA ~ Content of register RA 


SRCB M = 0: Content of register RB 
| M = 1: | (Zero-extended to 32 bits) 


DEST Register RC 


23 | 15 7 0 


OP = 38, 39 SUBRCS 


Description: 


The SRCB operand is added to the one’s-complement of the SRCA 
operand and the value of the ALU Status Carry bit. The result is placed 
into the DEST location. 


If the add operation causes a two’s-complement signed overflow, an 
Out-of-Range trap occurs. 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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SUBRCU , | SUBRCU 
Subtract Reverse with Carry, Unsigned 


Operation: DEST <—SRCB-SRCA-1+C 
IF unsigned underflow THEN Trap (Out of Range) 


Assembler 
Syntax: SUBRCU Ic, ra, rb 
or 
SUBRCU re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M =0: Content of register RB» 
M = 1: I (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 _ 7 0 
OP = 3A, 3B | SUBRCU | 


Description: The SRCB operand is added to the one’s-complement of the SRCA 
operand and the value of the ALU Status Carry bit. The result is placed 
into the DEST location. | 


If the add operation causes an unsigned underflow, an Out-of-Range 
trap occurs. 


Note that the DEST location is altered whether or not an underflow 
occurs. 
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SUBRS | | -SUBRS 
Subtract Reverse, Signed | 


Operation: DEST < SRCB-—SRCA 
IF signed overflow THEN Trap (Out of Range) 


Assembler 
Syntax: SUBRS rc, ra, rb 
or | 
SUBRS re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB _ M=0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST - Register RC 
31 23 15 7 0 
OP = 30, 31 | SUBRS 


Description: The SRCB operand is added to the two’s-complement of the SRCA | 
operand and the result is placed into the DEST location. If the add 
operation causes a two’s-complement signed overflow, an Out-of-Range 
trap occurs. 


Note that the DEST location is altered whether or not an overflow 
occurs. 
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SUBRU | 7 | SUBRU 
Subtract Reverse, Unsigned 
Operation: DEST<~SRCB-SRCA  ——~™ 
IF unsigned underflow THEN Trap (Out of Range) 


Assembler 
Syntax: SUBRU rc, ra, rb 
or | 
SUBRU re, ra, const8 


Status: V,N,Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 230 15 7 0 
| OP = 32, 33 : _ SUBRU 


Description: The SRCB operand is added to the two’s-complement of the SRCA 
Operand and the result is placed into the DEST location. If the add 
_ operation causes an unsigned underflow, an Out-of-Range trap occurs. 


Note that the DEST location is altered whether or not an underflow 
Occurs. | 


18-122 | ~ Instruction Set 


SUBS 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


AMD &A | 


SUBS 
Suen Signed 
DEST — SRCA — SRCB 
IF signed overflow THEN Trap (Out of Range) 
SUBS rc, ra, rb 
or 
SUBS rc, ra, const8 
V,N, Z, C 
SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC | 
23 15 7 0 


0010000M RC RA | RB or | 


OP = 20, 21 7 SUBS 


Description: 


The SRCA operand is added to the two’s-complement of the SRCB 
operand and the result is placed into the DEST location. If the add 
operation causes a two’s-complement signed overflow, an Out-of-Range 


trap occurs. 
Note that the DEST location is altered whether or not an overflow | 
occurs. 
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SUBU a SUBU 
Subtract, Unsigned 


Operation: DEST <SRCA-SRCB 
IF unsigned underflow THEN Trap (Out of Range) 


Assembler 
Syntax: SUBU re, ra, rb 
or 
SUBU re, ra, const8 


Status: V,N,2Z,C 


Operands: SRCA Content of register RA 
SRCB M = 0: Content of register RB 
M= 1:1 (Zero-extended to 32 bits) 
DEST Register RC 
31 ——- 23 15 | 7 0 


OP = 22, 23 SUBU 


Description: The SRCA operand is added to the two’s-complement of the SRCB _ 
operand and the result is placed into the DEST location. If the add 
operation causes an unsigned underflow, an Out-of-Range trap occurs. 


Note that the.DEST location is altered whether or not an underflow 
occurs. 
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XNOR . | XNOR 
Exclusive-NOR Logical 


Operation: DEST < ~(SRCA“ SRCB) 


Assembler 
Syntax: XNOR ec, ra, rb 
or 
XNOR re, ra, const8 
Status: N, Z 
Operands: SRCA Content of register RA 
SRCB~ M = 0: Content of register RB 
M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
31 23 15 7 0 
ROBO! AAPA MAREN MAST 
OP = 96, 97 | XNOR 


Description: The SRCA operand is logically exclusive-ORed, bit-by-bit, with the 
SRCB operand. The one’s-complement of the result is placed into the 
DEST location. | 
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XOR 


Operation: 


Assembler 
Syntax: 


Status: 
Operands: 


31 


XOR 
Exclusive-OR Logical 
DEST «+ SRCA“ SRCB 
XOR re, ra, rb 
or 
XOR re, ra, const8 
N,Z 
SRCA Content of register RA 
SRCB -M=0O: Content of register RB 
| M = 1: | (Zero-extended to 32 bits) 
DEST Register RC 
23 15 7 0 


OP = 94, 95 ~ XOR 


Description: 


18-126 


The SRCA operand is logically exclusive-ORed, bit-by-bit, with the 
SRCB operand, and the result is placed into the DEST location. 
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INSTRUCTION INDEX BY OPERATION CODE 


01 

02 

03 

04 
06,07 
08,09 
0A,0B 
0C,0D 
OE,0F 
10,11 
12,13 
14,15 
16,17 


— 18,19 


1A,1B 
10,1D 
1E,1F 
20,21 
22,23 
24,25 
26,27 
28,29 
2A,2B 
2C,2D 
2E,2F 
30,31 
32,33 
34,35 
36,37 
38,39 
3A,3B 
3C,3D 
3E,3F 


40,41 


42,43 


44,45 


46,47 
48,49 
4A,4B 
4C,4D 
4E,4F 
50,51 
52,53 
54,55 
56,57 


CONSTN. 
CONSTH 
CONST 
MTSRIM 
LOADL 
CLZ 
EXBYTE 
INBYTE 
STOREL 
ADDS 
ADDU 
ADD 
LOAD 
ADDCS 
ADDCU 
ADDC 
STORE 
SUBS 
SUBU 
SUB 
LOADSET 
SUBCS 
SUBCU 
SUBC 
CPBYTE 
SUBRS 
SUBRU 


- SUBR 


LOADM 
SUBRCS 
SUBRCU 
SUBRC 
STOREM 
CPLT 
CPLTU 
CPLE 
CPLEU 
CPGT 
CPGTU 
CPGE 
CPGEU 
ASLT 


~ASLTU 


ASLE 
ASLEU 


Constant, Negative 
Constant, High 

Constant 

Move to Special Register Immediate 
Load and Lock 

Count Leading Zeros 
Extract Byte 

Insert Byte 

Store and Lock 

Add, Signed 

Add, Unsigned 

Add 

Load 

Add with Carry, Signed 
Add with Carry, Unsigned 
Add with Carry © 

Store 

Subtract, Signed 

Subtract, Unsigned 
Subtract 

Load and Set 

Subtract with Carry, Signed 
Subtract with Carry, Unsigned 
Subtract with Carry 


‘Compare Bytes 


Subtract Reverse, Signed — 

Subtract Reverse, Unsigned 

Subtract Reverse | 

Load Multiple 

Subtract Reverse with Carry, Signed 
Subtract Reverse with Carry, Unsigned 
Subtract Reverse with Carry 

Store Multiple 

Compare Less Than 

Compare Less Than, Unsigned 

Compare Less Than or Equal To . 
Compare Less Than or Equal To, Unsigned 
Compare Greater Than 

Compare Greater Than, Unsigned 
Compare Greater Than or Equal To 
Compare Greater Than or Equal To, Unsigned 
Assert Less Than 

Assert Less Than, Unsigned . 

Assert Less Than or Equal To 

Assert Less Than or Equal To, Unsigned 
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18-128 


58,59 
5A,5B 
5C,5D 
5E,5F 
60,61 
62,63 
64,65 
66,67 
68,69 


6A,6B 


6C,6D 
6E,6F 
70,71 
72,73 
74,75 
78,79 


7A,7B | 


7C,7D 
7E 
80,81 
82,83 
86,87 
88 
89 
8C 
90,91 
92,93 
94,95 
96,97 
98,99 
9A,9B 
9C,9D 
9E 
OF 
AO,A1 
A4,A5 
A8,A9 
AC,AD 
B4,B5 
B6 
BE | 
co 
C4 
C6 
C8 
CC 


ASGT 
ASGTU 
ASGE 
ASGEU 
CPEQ 
CPNEQ 
MUL 
MULL 
DIVO 
DIV 
DIVL 
DIVREM 
ASEQ 
ASNEQ 
MULU 
INHW 
EXTRACT 
EXHW 
EXHWS 
SLL 
SRL 
SRA 
IRET 
HALT 
IRETINV 
AND 
OR 
XOR | 
XNOR 
NOR 
NAND 
ANDN 
SETIP 
INV 
JMP 
JMPF 
CALL 
JMPT 
JMPFDEC 
MFTLB 
MTTLB 
JMPI 
JMPFI 
MFSR 
CALLI 
JMPTI 


Assert Greater Than 

Assert Greater Than, Unsigned 

Assert Greater Than or Equal To 

Assert Greater Than or Equal To, Unsigned 
Compare Equal To 

Compare Not Equal To 

Multiply Step | 

Multiply Last Step 

Divide Initialize 

Divide Step 

Divide Last Step 

Divide Remainder 

Assert Equal To 

Assert Not Equal To 

Multiply Step, Unsigned 

Insert Half-Word 

Extract Word, Bit-Aligned 

Extract Half-Word 

Extract Half-Word, Sign-Extended 

Shift Left Logical 

Shift Right Logical 

Shift Right Arithmetic 
Interrupt Return 

Enter HALT Mode 

Interrupt Return and Invalidate 
AND Logical 

OR Logical 

Exclusive-OR Logical 
Exclusive-NOR Logical 
NOR Logical 

NAND Logical 

AND-NOT Logical 

Set Indirect Pointers 
Invalidate 

Jump 

Jump False 

Call Subroutine 

Jump True 

Jump False and Decrement 
Move from Translation Look-Aside Buffer Register 
Move to Translation Look-Aside Buffer Register 
Jump Indirect 

Jump False Indirect 

Move from Special Register 

Call Subroutine, Indirect 

Jump True Indirect 


Instruction Set 


CE 

D7 
D8—DD 
DE 

DF 

EO 

E1 

E2 

E3 

E4 

E5 

E6 
E7-E9 
EA | 
EB 

EC 

ED 

EE 


Er 


FO 

Ft 
F2 

F3 

F4 

F5 

F6 
-F7 

F8 
F9 
FA-FF 
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MTSR Move to Special Register 
EMULATE Trap to Software Emulation Routine 
Reserved for emulation (trap vector numbers 24-29) 
MULTM Integer Multiply Most Significant Bits, Signed 
MULTMU Integer Multiply Most Significant Bits, Unsigned 
MULTIPLY Integer Multiply, Signed 
DIVIDE Integer Divide, Signed 
MULTIPLU Integer Multiply, Unsigned 
DIVIDU Integer Divide, Unsigned 
CONVERT Convert Data Format 
SQRT Square Root 
CLASS Classify Floating-Point Operand 
Reserved for emulation (trap vector number 39-41) 
FEQ Floating-Point Equal To, Single-Precision _ 
DEQ Floating-Point Equal To, Double-Precision 
FGT Floating-Point Greater Than, Single-Precision 
DGT Floating-Point Greater Than, Double-Precision 
FGE Floating-Point Greater Than or Equal To, 

: | Single-Precision | 
DGE Floating-Point Greater Than or Equal To, 

Double-Precision 

FADD Floating-Point Add, Single-Precision 
DADD Floating-Point Add, Double-Precision 
FSUB Floating-Point Subtract, Single-Precision 
DSUB Floating-Point Subtract, Double-Precision 
FMUL Floating-Point Multiply, Single-Precision 
DMUL Floating-Point Multiply, Double-Precision 
FDIV Floating-Point Divide, Single-Precision 
DDIV | Floating-Point Divide, Double-Precision 
Reserved for emulation (trap vector number 56) 
FDMUL Floating-Point Multiply, Single-to-Double-Precision 


Reserved for emulation (trap vector numbers 58-63) 
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SPECIAL SETTINGS FOR THE é- 
Am29200 AND Am29205 MICROCONTROLLERS 





Am29200 MICROCONTROLLER 


Before using the Am29200 microcontroller product, the user should prepare the 
microcontroller by setting the following signals as shown. 


m Tie the TRST signal to RESET, whether or not the JTAG port will be used. 


m If the JTAG port will not be used, drive TCK, TMS, and TDI to known states, prefer- 
ably through pull-up resistors. Although these signals have weak internal active pull- 
ups, tying them to a known signal level will eliminate any system noise from being 
coupled into the JTAG interface. 


& If the serial port will not be used, tie the UCLK signal High. 


Am29205 MICROCONTROLLER 


Before using the Am29205 microcontroller product, the user should prepare the 
microcontroller by setting the following signals and fields as shown. 


m Pull the WAIT/TRIST signal High. 
m lf the serial port will not be used, tie the UCLK signal High. 


= Program the ASEL3 field in the ROM Configuration Register with a value that does 
not overlap with addresses specified for ROM Banks 0 through 2. 


In the PIA Control Register 0, write bits 15-0 with Os. 

In the PIO Control Register, write bits 7—O with Os. 

In the PIO Output Register, write bits 7—0O with Os. 

In the PIO Output Enable Register, write bits 7—O with Qs. 
@ Inthe Parallel Port Control Register, set the FWT bit to 0. 


Note that all registers with fields designated as “reserved” should be programmed with 
Os to ensure compatibility. 
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Figure B-1 General-Purpose Register Organization 


Absolute 

Register # General Purpose 
aa Indirect Pointer Access 
a Stack Pointer | 


2-63 Not Implemented 







Global Register 64 
Global Register 65 


Global Register 66 


Global 


Registers 
31 8 210 


rsp 
Global Register 126 
131 
. (example) 
Global Register 127 


ee Local Register 125 
po Local Register 126 


po Local Register 127 
pt Local Register 0 | 
ae Local Register 1 












See Section 4.1.3 for more 
detail on the operation of 
the register stack pointer 


(rsp). 


Local 
Registers 






Local Register 123 
Local Register 124 
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Figure B-2 


Register Bank Organization 




















Register 
Bank Protect Absolute-Register General-Purpose 
Register Bit Numbers Registers 


48 through 63 

















ee 


240 through 255 Bank 15 
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Figure B-3 Special-Purpose Registers 


REG # 
31 23 15 7 


| 
wwe ete) af else 


oO 


So 
< 
ee) 

zm 


Vector Area Base Address (VAB) 
Page 16-5 . | 


1 oe) 


ak 
. 
i 


1 23 


15 7 
resent tT EEE LL 
’ : (7 ae ae 


Old Processor Status (OPS) aa 
Page 16- TD Res, TE;TU, Res, Res, IM 


i] i] 
IP TP FZ WM SM _ ODI 


oO 


io] 


1 23 


15 7 
2s ha ha 


Current Processor Status (CPS) : 
Page 16-1 TD Res; TE, TU, Res, Res, IM 


6 
] 
L] 
‘ ‘ - 6 
IP TP FZ WM SM sD 


— 0 


} 
fed 


31 23 15 7 0 
3 . 
Configuration (CFG) 
Page 2-26 
31 23 15 7 0 
: CHA 
\ Channel Address (CHA) 
Page 16-18 
31 23 15 7 0 
> CHD 


Channel Data (CHD) 
Page 16-18 


23 | 


. 15 7. 
Sa a 
Channel Control! (CHC) © As. ot 


Res Page 16-19 LS: ST Res _ NN: 
ML CV 


o 
-.{_]2 
as eed 
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Special-Purpose Registers (continued) 


REG # 





7 
eeeaeeaoeeaoeoeoeoeee eo 
31 23 | 15 7 
Timer Counter (TMC) — 
Page 16-22 
81 23 15 7 
° | esenes | | | TAY 
» ' t 
oie 
OV, IE 
IN 
Timer Reload (TMR) 
Page 16-23 
31 23 15 7 
my PCO 
Program Counter 0 (PCO) 
Page 16-9 
31 23 15 7 
" PC 
Program Counter 1 (PC1) 
Page 16-9 
31 23 15 7 
12 PC2 
_ Program Counter 2 (PC2) 
Page 16-10 
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Figure B-3 Special-Purpose Registers (continued) 


REG # 


128 


129 


130 


131 


132 


133 


31 23 15 7 0 
Be 
Indirect Pointer C (IPC) 

Page 2-13 

31 23 15 - 7 


0 
Reserved IPA | o 


indirect Pointer A (IPA) 
Page 2-13 | , 


31 23 15 7 


0 
every rearerenroongrery 


Indirect Pointer B (IPB) 


Page 2-14 
31 23 15 7 0 
Q(Q) 
Page 2-20 . , 
31 23 | 15 7 0 
Perera 
ALU Status (ALU) : ep ap: 
Page 2-16 DE:-N'C 

V Z BP 


23 15 7 Le) 


Byte Pointer (BP) 
Page 3-3 
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Special-Purpose Registers (continued) 


REG # 


31 23 15 7 


0 
1 Jolofolofo}ololofofofofofofololaofololofolelololalofol ro 


Funnel Shift Count (FC) 
Page 3-3 


31 23 15 7 


0 
125 Joo ofoofofofofofofofo}ofofofofofofofofelofo] ce 


160 


Load/Store Count Remaining (CR) 
Page 3-11 


31 23 15 


- 
peer ra 


“L_Jo 


Floating-Point Environment (FPE) 


_ Page 2-14 FF DM UM! RM 


0 
161 
: Reserved 


162 


XM VM NM 
Note: this is a virtual register not implemented directly in hardware 


31 23 15 7 


Integer Environment (INTE) 


Page 2-15 DO 
MO 

Note: this is a virtual register not implemented directly in hardware 

31 23 15 7 0 


Reserved 





Floating-Point Status (FPS) — | ee ae a ee 
Page 2-18 . DT, UT, RT, DS, US. RS, 
XT VT NT XS VS NS 


Note: this is a virtual register not implemented directly in hardware 
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Table B-1 Processor Register Field Summary 
Label Field Name Register Bit 
BO | Bank 0 Protection Bit Register Bank Protect 0 
Bt Bank 1 Protection Bit Register Bank Protect 1 
B2 Bank 2 Protection Bit Register Bank Protect 2 
B3 Bank 3 Protection Bit Register Bank Protect 3 
B4 Bank 4 Protection Bit Register Bank Protect 4 
B5 Bank 5 Protection Bit Register Bank Protect 5 
B6 Bank 6 Protection Bit Register Bank Protect 6 
B7 Bank 7 Protection Bit Register Bank Protect 7 
B8 Bank 8 Protection Bit Register Bank Protect 8 
B9 Bank 9 Protection Bit Register Bank Protect 9 
B10 Bank 10 Protection Bit ~ Register Bank Protect 10 
Bit Bank 11 Protection Bit Register Bank Protect 11 
B12 Bank 12 Protection Bit Register Bank Protect 12 
B13 Bank 13 Protection Bit Register Bank Protect 13 
B14 Bank 14 Protection Bit Register Bank Protect 14 
B15 Bank 15 Protection Bit Register Bank Protect 15 
BP Byte Pointer ALU Status 6-5 
Byte Pointer 1-0 
C Carry ALU Status 7 
CHA Channel! Address Channel Address 31-0 
CHD Channel Data Channel Data 31-0 
CNTL Control Channel Control 30-24 
CR Load/Store Count Remaining Channel Control 23-16 
Load/Store Count Remaining 7-0 
CV Contents Valid Channel Control 0 
DA Disable All Interrupts and Traps Current Processor Status 0 
. Old Processor Status 0 
DF Divide Flag ALU Status 1 
Di Disable Interrupts Current Processor Status 1 
| _ Old Processor Status 1 
DM Floating-Point Divide By Zero Mask Floating-Point Environment 5 
DO Integer Division Overflow Mask Integer Environment 1 
DS Floating-Point Divide By Zero Sticky Floating-Point Status 5 
DT Floating-Point Divide By Zero Trap ALU Status 13 
FF Fast Floating-Point Select Floating-Point Environment 8 
FC Funnel Shift Count ALU Status 4-0 
Funnel Shift Count 4-0 
FRM Floating-Point Round Mode Floating-Point Environment 7-6 
FZ Freeze Current Processor Status 10 
Old Processor Status 10 
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Table B-1 Register Field Summary (continued) 
Label Field Name Register Bit 
IE Interrupt Enable Timer Reload 24 
IM Interrupt Mask Old Processor Status 3-2 
Current Processor Status 3-2 
IN Interrupt Timer Reload 25 
IP Interrupt Pending Current Processor Status 14 
Old Processor Status 14 
IPA Indirect Pointer A Indirect Pointer A 9-2 
IPB Indirect Pointer B Indirect Pointer B 9-2 
IPC Indirect Pointer C Indirect Pointer C 9-2 
LS Load/Store Channel Control 15 
ML Multiple Operation Channel Control 14 
MO Integer Multiplication Overflow Mask Integer Environment 0 
N Negative | ALU Status 9 
NM Floating-Point Invalid Operation Mask Floating-Point Environment 0 
NN Not Needed Channel Control 1 
NS Floating-Point Invalid Operation Sticky Floating-Point Status 0 
NT Floating-Point Invalid Operation Trap Floating-Point Status 8 
OV Overflow Timer Reload | 26 
PCO Program Counter 0 Program Counter 0 31-2 
PC1 Program Counter 1 Program Counter 1 31-2 
PC2 Program Counter 2 Program Counter 2 31-2 
PRL Processor Release Level Configuration 31-24 
Q Quotient/Multiplier | Q Register 31-0 
RM Floating-Point Reserved Operand Mask Floating-Point Environment 1 
RS Floating-Point Reserved Operand Sticky Floating-Point Status 1 
RT Floating-Point Reserved Operand Trap Floating-Point Status 9 
SM Supervisor Mode Current Processor Status 4 
Old Processor Status 4 
ST Set Channel Control 13 
TCV Timer Count Value Timer Counter 23-0 
TD Timer Disable Current Processor Status 17 
Old Processor Status 17 
TE Trace Enable Current Processor Status 13 
Old Processor Status 13 
B-8 
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Table B-1 Register Field Summary (continued) 
Label Field Name Register Bit 
TP Trace Pending Current Processor Status 12 
Old Processor Status 12 
TR Target Register Channel Control . 9~2 
TRV Timer Reload Value Timer. Reload 23-0 
TU Trap Unaligned Access Current Processor Status 11 
; Old Processor Status 11 
UM Floating-Point Underflow Mask Floating-Point Environment 3 
US Floating-Point Underflow Sticky Floating-Point Status 3 
UT Floating-Point Underflow Trap Floating-Point Status 11 
V Overflow ALU Status 10 
VAB Vector Area Base Vector Area Base Address 31-10 
VM Floating-Point Overflow Mask Floating-Point Environment 2 
vs Floating-Point Overflow Sticky Floating-Point Status | 2 
VT Floating-Point Overflow Trap Floating-Point Status 10 
WM Wait Mode Current Processor Status 7 
Old Processor Status 7 
XM Floating-Point Inexact Result Mask Floating-Point Environment 4 
XS Floating-Point inexact Result Sticky Floating-Point Status 4 
XT Floating-Point Inexact Result Trap Floating-Point Status 12 
Zz Zero ALU Status 8 
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Figure C-1 On-Chip Peripheral Registers 


Address 
(hexadecimal) 













31 15 7 0 
80000000 
BSTO LM'res BS BST2 BST3 
BWE 
ROM Control Register (RMCT) 
Page 8-1 
31 23 15 7 0 
ee Cee ae ka 
ASELO |AMASKO} ~—=ASEL1 AMASK1 ASEL2  JAMASK2}. | ASELS 
ROM Configuration Register (RMCF) 
Page 8-3 
31 23 | 15 7 0 
cone ELL Lest | wel | fice] | me | ner 
_ REFRATE 
PGO! res'PG1! = PG2!— ss PGS: SC 
DWO LM DW1 DW2 DW3 
DRAM Control Register (DRCT) oie 
Page 9-1 


31 23 15) 7 0 
8000000C | 
| ASELO | |AMASKO ASEL1 AMASK‘1 ASEL2 AMASK2 ASEL3 AMASK3 


DRAM Configuration Register (DRCF) 


Page 9-3 : 

' 31 23 15 7 0 

| VALI 
DRAM Mapping Register 0 (DRMO) 
Page 9-3 
31 23 15 7 9) 
VALID | | 

DRAM Mapping Register 1 (DRM1) 
Page 9-4 . 


= Reserved on Am29205 microcontroller 
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Figure C-{ On-Chip Peripheral Registers (continued) 


Address 
(hexadecimal) 


80000018 


8000001C 


80000020 


80000024 


80000028 


80000030 


80000034 


= 





VALID 
DRAM Mapping Register 2 (DRM2) 
Page 9-4 
VALID 
DRAM Mapping Register 3 (DRM3) 
Page 9-4 
31 23 15 : : 
rs] rowaro | Joos] town 
lOEXTO lOEXT1 lOEXT2 lOEXTS 
PIA Control Register 0 (PICTO) 
Page 10-1 
31 B83 15 7 0 





lIOEXT4 {OEXTS 





PIA Control Register 1 (PICT1) 

Page 10-1 

mallet ee 

; VDI res , PPI RXSI'TXDI 

Interrupt Control Register (ICT) DMAO! * res RXDI 

Page 16-24 -DMAII 

31 | 23 15 7 . 0 
vu | ones LL LL dee | 

DMAEXT . acs | UD! EN'CTE: Tt: 

DMAO Control Register (DMCTO) By: Te OE) oN 

Page 11-2 - 

31 


23 15 7 0 
DRAMADDR- 





DMAO Address Register (DMADO) 
Page 11-4 
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Figure C-1 On-Chip Peripheral Registers (continued) 


Address 
(hexadecimal) 


80000070 31 23 15 . 7 0 


reserved DRAMADDR 
80000036 


(alternate) DMAO Address Tail Register (TADO) 
(See Section 7.4) Page 11-4 


31 23 15 7 


0 
reserved DMACNT 


DMAO Count Register (DMCNO) 
Page 11-5 


g000003Cc = 31 23 15 7 0 


8000003A | reserved DMACNT 


(alternate) DMAO Count Tail Register (TCNO) 
(See Section 7.4) Page 11-5 





31 23 15 7 0 

sooocoe | | | ouawar fowlooml | resowes LLL I | ws LL) 
DMAEXT ACS UD! EN !CTE TT 
DMA1 Control Register (DMCT1) es oe 
Page 11-5 . 


DMAi Address Register (DMAD1) 
Page 11-7. 


31 23 15 7 0 


reserved DMACNT 


31 23 15 7 0 


DMA1 Count Register (DMCN1) 





Page 11-7 
31 23 15 7 0 
soneneo | ered | | film lovccsl fas) meet ford nam _| fe 
; reserved WLGN reserved HH —sreserved RMODE 
‘BRK: STP | | RSIE 
LOOP DSR 
Serial Port Control Register (SPCT) 


Page 14-1 
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Figure C-1 On-Chip Peripheral Registers (continued) 


Address 
(hexadecimal) 


31 23 15 7 0 
es Ne 





0000084 
Serial Port Status Register (SPST) THRE § —>s- ‘BRKI PER: 
Page 14-3 TEMT RDR DTR FER OER 
31 23 15 7 0 


Serial Port Transmit Holding Register (SPTH) 
Page 14-4 


31 23 15 7 0 


Serial Port Receive Buffer Register (SPRB) 
Page 14-4 | 


31 23 15 7 0 


Baud Rate Divisor Register (BAUD) 





Page 14-5 | 
31 23 15 7 0 
ese El 
: reserved TDELAY 
ress | | DRa! DDIR 'FACK — BRS: AFD 
FWT TRA ; FBUSY DHH ARB 
Parallel Port Control Register (PPCT) 
Page 13-1 
7 0 
800000C8 oh 2s T Ue 
(preferred) 
reserved TDELAYV reserved BCT ~ reserved 
800000C1 : 1? 
(alternate) STB 1 ACK 
(See Section 7.4) Parallel Port Status Register (PPST) BSY 
Page 13-3 
31 23 15 7 0 
8 bits 
80000004 . Am29200 and Am29205 microcontrollers 
31 23 15 7 0 
Parallel Port Data Register (PPDT) | 832 bits 
Page 13-4 Am29200 microcontroller only 
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Figure C-1 On-Chip Peripheral Registers (continued) 


Address 
(hexadecimal) 











800000D0 
PIO Control Register (POCT) 
Page 12-1 
31 23 15 7 0 
800000D4 | 
PIO Input Register (PIN) 
Page 12-2 
31 23 15 7 0 
reserved 
PIO Output Register (POUT) 
Page 12-2 





reserved . 


PIO Output Enable Register (POEN) 





Page 12-3 
— ee Hee el FIT 

Video Control aici (VCT) DDIR CLKI:PSIO! PSL!SDIR! 
Page 15-1 res PSI LSI VIDI 

31 23 © 15 7 0 
Top Margin Register (T sal 
Page 15-3 | 
31 23 15 | 7 0 


a LEFTCNT _LINECNT 


Side Margin Register (SIDE) 
Page 15-3 


31 23 5 7 0 
800000EC 


VDATA 


Video Data Holding Register (VDT) 
Page 15-4 
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Table C-1 


C-6 


Peripheral Register Field Summary 


Label 
ACK 
ACS 


AFD 
AMASKO 


AMASK1 
AMASK2 
AMASK3 


ARB 
ASELO 


ASEL1 
ASEL2 
ASEL3 


BAUDDIV 
BCT 
BRK 
BRKI 
BRS 


_ BSTO 


BST1 
BST2 
BST3 
BSY 
BWE 


~ CLKDIV 


CLKI 
CTE 


CTl 


Field Name 


PACK Level 


Assert Chip Select 


Autofeed 


Address Mask, Bank 0 
Address Mask, Bank 1 
Address Mask, Bank 2 
Address Mask, Bank 3 


ACK Relationship to BUSY 


Address Select, Bank 0 
Address Select, Bank 1 
Address Select, Bank 2 
Address Select, Bank 3 


Baud Rate Divisor 

Byte Count » 

Send Break 

Break Interrupt 

BUSY Relationship to STROBE 
Burst-Mode ROM, Bank 0 
Burst-Mode ROM, Bank 1 
Burst-Mode ROM, Bank 2 
Burst-Mode ROM, Bank 3 
PBUSY Level | 

Byte Write Enable 

Clock Divide 

Clock Invert 


Count Terminate Enable 


Count Terminate Interrupt 


Register 


Parallel Port Status 


DMAO Control . 
DMA1 Control 


Parallel Port Control 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


Parallel Port Control 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


ROM Configuration 
DRAM Configuration 


Baud Rate Divisor 
Parallel Port Status 
Serial Port Control 
Serial Port Status 
Parallel Port Control 
ROM Control 

ROM Control 

ROM Control 

ROM Control | 
Parallel Port Status 
ROM Control 
Video Control 


Video Control — 


- DMAO Control 


DMA1 Control 


DMAO Control 
DMAi1 Control 


Peripheral Register Summary 


Bit 


19 
19 


26-24 
26-24 


18-16 
18-16 


10-8 
10-8 


2-0 
2-0 


31-27 
31-27 


23-19 
23-19 


15-11 
15—11 


7-3 
7-3 


15-0 


25. 


31 


23 
15 


27 
14-11 


oo ao 


AMD &\ 


Table C-1 Peripheral Register Field Summary (continued) 


Label Field Name Register Bit 
DDIR Data Direction Parallel Port Control 10 
Video Control 10 
DHH Disable Hardware Handshake Parallel Port Control 5 
DMAOI DMA Channel 0 Interrupt _ ° Interrupt Control 14 
DMAI1! DMA Channel 1 Interrupt Interrupt Control 13 
DMACNT DMA Count DMAO Count 23-0 
DMAO Count Tail 23-0 
DMA1 Count 23-0 
DMAEXT DMA Extend i DMAO Control 31 
| DMA1 Control 31 
DMAWAIT DMA Wait States DMAO Control 28-24 
DMAt1 Control | 28-24 
DRAMADDR-_ DRAM Address DMAO Address 23-0 
DMAO Address Tail 23-0 
DMA1 Address_~ 23-0 
DRM DMA Request Mode DMAO Control 21-20 
DMA1 Control 21-20 
DRQ Data Request Parallel Port Control — 15 
Video Control 15 
DSR Data Set Ready Serial Port Control | 24 
DTR Data Terminal Ready a Serial Port Status | 4 
DW Data Width DMAO Control | 22-23 
DMA1 Control 22-23 
Dwo Data Width, Bank 0 ROM Control — 30-29 
DRAM Control 30 
Dwi Data Width, Bank 1 ROM Control | 22-21 
DRAM Control 26 
Dw2 Data Width, Bank 2 ROM Contro! | 14-13 
| DRAM Control | 22 
DW3 Data Width, Bank3 ROM Control 6-5 
| DRAM Control _ | 18 
EN Enable DMAO Control 7 
| DMAt1 Control 7 
FACK Force ACK Parallel Port Control _ , 6 
FBUSY Force Busy Parallel Port Control 7 
FER Framing Error Serial Port Status 2 
FWT Full Word Transfer | Parallel Port Control 30 


INVERT PIO Inversion PIO Control 15-0 





Peripheral Register Summary C-7 


oA amp 
Table C-1 


Peripheral Register Field Summary (continued) 


Label 


IOEXTO 
IOEXT1 
\OEXT2 
|OEXT3 
|OEXT4 
IOEXTS 
lOPI 
lIOWAITO 
l|OWAIT1 
lOWAIT2 
IOWAIT3 
lOWAIT4 
IOWAITS 
IRM8 
IRM9 
IRM10 
IRM11 
IRM12 
IRM13 
IRM14 
IRM15 
LEFTCNT 
LINECNT 
LM 
LOOP 
LSI 


MODE 


OER 
PDATA 


PER 
PERADDR 


Field Name 


Input/Output Extend, Region 0 
Input/Output Extend, Region 1 
Input/Output Extend, Region 2 
Input/Output Extend, Region 3 
Input/Output Extend, Region 4 
Input/Output Extend, Region 5 

I/O Port Interrupt 

Input/Output Wait States, Region 0 
Input/Output Wait States, Region 1 
Input/Output Wait States, Region 2 
Input/Output Wait States, Region 3 
Input/Output Wait States, Region 4 
Input/Output Wait States, Region 5 
Interrupt Request Mode, PIO8 
Interrupt Request Mode, PIO9 
Interrupt Request Mode, PIO10 
Interrupt Request Mode, PIO11 
Interrupt Request Mode, Pi012 
Interrupt Request Mode, PIO13 
interrupt Request Mode, PIiO14 
Interrupt Request Mode, P!015 
Left Margin Count 

Line Count 


Large Memory 


Loopback 
Line Sync Invert 


Parallel Port Mode 
Video Interface Mode 


Overrun Error 


Parallel Port Data 


Parity Error 


Peripheral Address 


Register 


PIA Control 0 
PIA Control! 0 
PIA Control 0 
PIA Control 0 
PIA Control 1 
PIA Control 1 
Interrupt Control 
PIA Control 0 
PIA Control 0 
PIA Control 0 
PIA Control 0° 
PIA Control 1 
PIA Control 1 
P!O Control 
PIO Contro! 
PIO Contro! 
PiO Control 
PIO Control 
PIO Control 
PIO Control 
PIO Control! 
Side Margin 
Side Margin 


ROM Contro! 
DRAM Control 


Serial Port Control 
Video Control 


Parallel Port Control 
Video Control 


Serial Port Status . 


Parallel Port Data 


Serial Port Status 


DMAO Address 
DMA1 Address 


Peripheral Register Summary 


Bit 
31 


23 
15 


31 
23 
23-16 
28-24 
20-16 
12-8 


28-24 
20-16 
17-16 
19-18 
21-20 
23-22 
25-24 
27-26 
29-28 
31-30 
27-16 
15-0 


28 
28 


26 


7-0 
31-0 


31-24 
31-24 
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Table C-1 Peripheral Register Field Summary (continued) 

Label Field Name Register Bit 
PGO Page-Mode DRAM, Bank 0 DRAM Control 31 
PG1 Page-Mode DRAM, Bank 1 DRAM Control 27 
PG2 Page-Mode DRAM, Bank 2 DRAM Control 23 
PG3 Page-Mode DRAM, Bank 3 _ DRAM Control 19 
PHYBASE Physical Base Address DRAM Mapping 0 7-0 

DRAM Mapping 1 7-0 

DRAM Mapping 2 7-0 

DRAM Mapping 3 7-0 
PIN PIO Input PIO Input 15-0 
PMODE Parity Mode Serial Port Control 21-19. 
POEN PIO Output Enable PIO Output Enable 15-0 
POUT PIO Output PIO Output — 15-0 
PPI Parallel Port Interrupt Interrupt Control . 11 
PSI Page Sync Invert Video Control 4 
PSIO Page Sync Input/Output | Video Control 5 
PSL Page Sync Level Video Control 3 
QEN Queue Enable DMAO Control 4 
RDATA Receive Data Serial Port Receive Buffer 7-0 
RDR Receive Data Ready Serial Port Status 8 
REFRATE Refresh Rate DRAM Control 8-0 
RMODE Receive Mode Serial Port Control 1-0 
RSIE Receive Status Interrupt Enable Serial Port Control 2 
RW Read/Write DMAO Control 8 

- DMA1 Control 8 
RXDI Serial Port Receive Data Interrupt Interrupt Control 6 
RXS! Serial Port Receive Status Interrupt Interrupt Control 7 
Sc: Static-Column DRAM DRAM Control 15 
SDIR Shift Direction Video Control 1 
STB PSTROBE Level | Parallel Port Status 31 
STP Stop Bits Serial Port Control 18 
TDATA Transmit Data Serial Port Transmit Holding 7-0 
TDELAY Transfer Delay Parallel Port Control 23-16 
TDELAYV TDELAY Counter Value Parallel Port Status 23-16 
TEMT Transmitter Empty Serial Port Status _ 10 
THRE Transmit Holding Register Empty Serial Port Status 9 
Peripheral Register Summary C-9 
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‘Table C-1 Peripheral Register Field Summary (continued) 
Label Field Name Register Bit 
TMODE Transmit Mode Serial Port Control. 9-8 
TOPCNT Top Margin Count Top Margin 11-0 
TRA Transfer Active Parallel Port Control 14 
TTE TDMA Terminate Enable DMAO Control 6 
DMA1 Control 6 
TTI TDMA Terminate Interrupt DMAO Control 1 
DMAt1 Control 1 
TXDI Serial Port Transmit Data Interrupt Interrupt Control 5 
UD Transfer Up/Down DMAO Control 9 
DMA‘ Control 9 
VALID Valid Mapping DRAM Mapping 0 31 
‘DRAM Mapping 1 31 
DRAM Mapping 2 31 
_ DRAM Mapping 3 31 
VIDI Video Invert Video Control 0 
VDATA Video Data Video Data Holding 31-0 
VDI Video Interrupt Interrupt Control 27 
VIRTBASE Virtual Base Address DRAM Mapping 0 23-16 
DRAM Mapping 1 23-16 © 
DRAM Mapping 2 23-16 
DRAM Mapping 3 23-16 
WLGN Word Length Serial Port Control 17-16 
WSO Wait States, Bank 0 ROM Control 25-24 
WS1 Wait States, Bank 1 ROM Control 17-16 
ws2 Wait States, Bank 2 ROM Control _ 9-8 
WS3 Wait States, Bank 3 ROM Control 1-0 
C-10 Peripheral Register Summary 


INDEX 


A 


A23-—A0 signals 
definition, 7-1 
external DMA transfers, 11-4 


absolute-register number, 2-10 

access priority, 7-7—7-8 

ACK bit (PACK Level), 13-4 

ACS bit (Assert Chip Select), 11-3, 11-6 


activation records 
allocation, 4-1-4-2, 4-4 
definition, 4-1 
ADD (Add) instruction, description, 18-8 


‘ Add Wait States signal. See WAIT signal; 
WAIT/TRIST signal 


ADDC (Add with Carry) instruction, description, 18-9 


ADDCS (Add with Carry, Signed) instruction, 
description, 18-10 


ADDCU (Add with Carry, Unsigned) instruction, 
description, 18-11 








addition instructions 
ADD (Add), 18-8 
ADDC (Add with Carry), 18-9 
ADDCS (Add with Carry, Signed), 18-10 
ADDCU (Add with Carry, Unsigned), 18-11 
ADDS (Add, Signed), 18-12 
ADDU (Add, Unsigned), 18-13 
DADD (Floating-Point Add, Double-Precision), 
18-47 
FADD (Floating-Point Add, Single-Precision), 18-65 


Address Bus signals. See A23—A0 signals 


addressing 
byte and half-word addressing, 3-11-3-12 
indirect register addressing, 2-12-2-14 
internal peripheral address assignments, 7-8—7-10 
registers, 2-10 : 


ADDS (Add, Signed) instruction, description, 18-12 — 
ADDU (Add, Unsigned) instruction, description, 18-13 
AFD bit (Autofeed), 13-3 


alignment 
of bytes within words, 3-4 
of instructions, 3-13 
of words and half-words, 3-13 
Unaligned Access trap, 16-2 


pa 


ALU Status Register 
arithmetic instructions, 2-1 
description, 2-16—2-17 
logical instructions, 2-4 


Am29200 microcontroller family 
development tools, 1-7 
overview, xv, 1-1 
product support, 1-7 F 


Am29200 microcontroller 
block diagram, 1-3 
design philosophy, xv—xvii 
distinctive characteristics, 1-2~—1-3 
overview 
burst-mode memory support, 1-8 
bus-compatibility, 1-7 
data formats, 1-8—-1-9 
debugging and testing, 1-9-1-10 
DRAM mapping, 1-9 
instruction set, 1-8 
instruction timing, 1-7-1-8 
interfaces, 1-6 
interrupts and traps, 1-9-1-10 
page-mode memory support, 1-8 
peripherals on-chip, 1-5-1-6 
pipelining, 1-8 
price/performance, 1-6 
protection, 1-9 
software-compatibility, 1-7 
performance overview, 1-7—1-9 
product support, ii 
special settings, A-1 


Am29205 microcontroller 


_ block diagram, 1-4 


design philosophy, xv—xvii 
distinctive characteristics, 1-4—1-5 
emulating the Am29205 microcontroller, 17-17. 
overview 
bus-compatibility, 1-7 
data formats, 1-8—-1-9 
debugging and testing, 1-9—1-10 
DRAM mapping, 1-9 
instruction set, 1-8 
instruction timing, 1-7—1-8 
interfaces, 1-6 
interrupts and traps, 1-9—1-10 
page-mode memory support, 1-8 
peripherals on-chip, 1-5-1-6 
pipelining, 1-8 
price/performance, 1-6 


Index | | l-1 
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- protection, 1-9 
software-compatibility, 1-7 
performance overview, 1-7—-1-9 

pin changes, 7-7 
product support, iii 
special settings, A-1 


AMASKO field (Address Mask, Bank 0) 
~ DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-2-8-3 


AMASK‘1 field (Address Mask, Bank 1) 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-3 


AMASK2 field (Address Mask, Bank 2) 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-3 


AMASKS field (Address Mask, Bank 3) 
Am29205 microcontroller, 8-3 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-3 


AND (AND logical) instruction, description, 18-14 


_ ANDN (AND-NOT logical) instruction, description, 
18-15 


ARB bit (ACK Relationship to BUSY), 13-3 
argcount value, 4-15 | 
argument passing, 4-7-4-8 


arithmetic instructions 
See also specific types of arithmetic instructions 
ALU Status Register, 2-1 
multiprecision integer operations, 2-25 
overview, 2-1—2-3 
Status results, 2-17 
table, 2-2 
~ trapping, 2-26 
virtual arithmetic processor, 2-26 


ASELO field (Address Select, Bank 0) 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-2 


ASEL1 field (Address Select, Bank 1) 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-3 


ASEL2 field (Address Select, Bank 2) 
DRAM Configuration Register, 9-3 
ROM Configuration Register, 8-3 


ASELS field (Address Select, Bank 3) 
Am29205 microcontroller, 8-3 
DRAM Configuration Register, 9-3 
required setting for Am29205 microcontroller, A-1 
‘ROM Configuration Register, 8-3 


ASEQ (Assert Equal To) instruction 
description, 18-16 
NO-OPs, 2-25-—2-26 


ASGE (Assert Greater Than or Equal ne instruction, 
description, 18-17 


ASGEU (Assert Greater Than or Equal To, Unsigned) 
instruction, description, 18-18 


ASGT (Assert Greater Than) instruction, sissenpiion: 
18-19 


ASGTU (Assert Greater Than, Unsigned) instruction, 
description, 18-20 


ASLE (Assert Less Than or Equal To) instruction, 
description, 18-21 


ASLEU (Assert Less Than or Equal To, Unsigned) 
instruction, description, 18-22 


ASLT (Assert Less Than) instruction, description, © 
18-23 


_ ASLTU (Assert Less Than, Unsigned) instruction, 


description, 18-24 


ASNEQ (Assert Not Equal To) instruction 
description, 18-25 
operating system calls, 2-24 


assert instructions 
run-time checking, 2-24 
setting instruction breakpoints, 17-2 
simulating interrupts and traps, 16-13-16-14 
trapping, 2-24 


B15-B0 field (Bank 15—Bank 0 Protection Bits), 6-3 
Baud Rate Divisor Register, description, 14-5 
BAUDDIV field (Baud Rate Divisor), 14-5 

BCT field (Byte Count), 13-3-—13-4 

big endian, 3-1, 3-2, 3-12 — 


bit strings 
Funnel Shift Count Register, 3-3-3-4 
overview, 3-3-3-4 


bits 


ACK (PACK Level), 13-4 

ACS (Assert Chip Select), 11-3, 11-6 

AFD (Autofeed), 13-3 

AMASKO (Address Mask, Bank 0), 8-2-8-3, 9-3 
AMASK1 (Address Mask, Bank 1), 8-3, 9-3 
AMASK2 (Address Mask, Bank 2), 8-3, 9-3 
AMASKS (Address Mask, Bank 3), 8-3, 9-3 
ARB (ACK Relationship to BUSY), 13-3 
ASELO (Address Select, Bank 0), 8-2, 9-3 
ASEL1 (Address Select, Bank 1), 8-3, 9-3 . 
ASEL2 (Address Select, Bank 2), 8-3, 9-3 
ASEL3 (Address Select, Bank 3), 8-3, 9-3 
B15-Bo (Bank 15—Bank 0 Protection Bits), 6-3 
BAUDDIV (Baud Rate Divisor), 14-5 

BCT (Byte Count), 13-3-13-4 

BP (Byte Pointer), 2-16-2-17, 3-3 





BRK (Send Break), 14-1 

BRKI (Break Interrupt), 14-4 

BRS (BUSY Relationship to STROBE), 13-3 
BSTO (Burst-Mode ROM, Bank 0), 8-1 

BST1 (Burst-Mode ROM, Bank 1), 8-2 

BST2 (Burst-Mode ROM, Bank 2), 8-2 

BST3 (Burst-Mode ROM, Bank 3), 8-2 

BSY (PBUSY Level), 13-4 | 

BWE (Byte Write Enable), 8-2 

C (Carry), 2-16 

CHA (Channel Address), 16-18 

CHD (Channel Data), 16-18 

CLKDIV (Clock Divide), 15-1 

CLKI (Clock Invert), 15-2 

CR (Load/Store Count Remaining), 3-11, 16-19 
CTE (Count Terminate Enable), 11-3, 11-7 
CT! (Count Terminate Interrupt), 11-4, 11-7 
CV (Contents Valid), 16-19 

DA (Disable All Interrupts and Traps), 16-3 
DDIR (Data Direction), 13-2, 15-2 : 

DF (Divide Flag), 2-16 

DHH (Disable Hardware Handshake), 13-2—13-3 
DI (Disable Interrupts), 16-3 

DM (Floating-Point Divide-By-Zero Mask), 2-15 
DMAOI (DMA Channel 0 Interrupt), 16-24 
DMA1I (DMA Channel 1 Interrupt), 16-24 
DMACNT (DMA Count), 11-5 

DMAEXT (DMA Extend), 11-2, 11-6 
DMAWAIT (DMA Wait States), 11-2, 11-6 

DO (Integer Division Overflow Mask), 2-15 
DRAMADDR (DRAM Address), 11-4 

DRM (DMA Request Mode), 11-2, 11-6 

DRQ (Data Request), 13-2, 15-1 

DS (Floating-Point Divide-By-Zero Sticky), 2-19 
DSR (Data Set Ready), 14-1 

DT (Floating-Point Divide-By-Zero Trap), 2-18-2-19 
DTR (Data Terminal Ready), 14-4 

DW (Data Width), 11-2, 11-6 

DW0 (Data Width, Bank 0), 8-2, 9-2 

DW1 (Data Width, Bank 1), 8-2, 9-2 

DW2 (Data Width, Bank 2), 8-2, 9-2 

DW3 (Data Width, Bank 3), 8-2, 9-2 

EN (Enable), 11-3, 11-7 

FACK (Force ACK), 13-2 

FBUSY (Force Busy), 13-2 

FC (Funnel Shift Count), 2-17, 3-3—3-4 

FER (Framing Error), 14-4 

FF (Fast Float Select), 2-14 

FRM (Floating-Point Round Mode), 2-14 

FWT (Full Word Transfer), 13-1 

FZ (Freeze), 16-2 

I, 3-8 

JE (Interrupt Enable), 16-23 

IM (interrupt Mask), 16-3 

IN (Interrupt), 16-23 

INVERT (PIO Inversion), 12-2 

l[OEXTO (Input/Output Extend, Region 0), 10-2 
IOEXT1 (Input/Output Extend, Region 1), 10-2 
IOEXT2 (Input/Output Extend, Region 2), 10-2 
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IOEXT3 (Input/Output Extend, Region 3), 10-2 

IOEXT4 (Input/Output Extend, Region 4), 10-2 

IOEXTS5 (Input/Output Extend, Region 5), 10-2 

IOPI (I/O Port Interrupt), 16-24 

lIOWAITO (Input/Output Wait States, Region 0), 10-2 

IOWAIT1 (Input/Output Wait States, Region 1), 10-2 

lLOWAIT2 (Input/Output Wait States, Region 2), 10-2 

IOWAITS (Input/Output Wait States, Region 3), 10-2 

IOWAIT4 (Input/Output Wait States, Region 4), 10-2 

IOWAITS5 (Input/Output Wait States, Region 5), 10-2 

IP (Interrupt Pending), 16-2 

IPA (Indirect Pointer A), 2-13 

IPB (Indirect Pointer B), 2-14 

IPC (Indirect Pointer C), 2-13 

1RM14—IRMB8, 12-2 

IRM15 (Interrupt Request Mode, PIO15), 12-1-12-2 

LEFTCNT (Left Margin Count), 15-3 

LINECNT (Line Count), 15-3 

LM (Large Memory), 8-2, 9-2 

LOOP (Loopback), 14-1 

LS (Load/Store), 16-19 

LSI (Line Sync Invert), 15-2 

ML (Multiple Operation), 16-19 

MO (Integer Multiplication Overflow Exception 
Mask), 2-16 

MODE (Parallel Port Mode), 13-2 

MODE (Video Interface Mode), 15-2 

N (Negative), 2-16 

NM (Floating-Point Invalid Operation Mask), 2-15 

NN (Not Needed), 16-19 

NS (Floating-Point Invalid Operation Sticky), 2-19 

NT (Floating-Point Invalid Operation Trap), 2-19 

OER (Overrun Error), 14-4 

OPT (Option), 3-8 

OV (Overflow), 16-23 

PCO (Program Counter 0), 16-9 

PC1 (Program Counter 1), 16-9 

PC2 (Program Counter 2), 16-10 

PDATA (Parallel Port Data), 13-4 

PER (Parity Error), 14-4 

PERADDR (Peripheral Address), 11-4 

peripheral registers (table), C-6—C-11 

PGO (Page-Mode DRAM, Bank 0), 9-2 

PG1 (Page-Mode DRAM, Bank 1), 9-2 

PG2 (Page-Mode DRAM, Bank 2), 9-2 

PG3 (Page-Mode DRAM, Bank 3), 9-2 

PHYBASE (Physical Base Address), 9-4 

PIN (PIO Input), 12-2 

PMODE (Parity Mode), 14-2 

POEN (PIO Output Enable), 12-3 

POUT (PIO Output), 12-3 

PPI (Parallel Port Interrupt), 16-24 

PRL (Processor Release Level), 2-27 

processor registers (table), B-7—B-9 

PSI (Page Sync Invert), 15-2 

PSIO (Page Sync Input/Output), 15-2 

PSL (Page Sync Level), 15-2 

Q (Quotient/Multiplier), 2-20 

QEN (Queue Enable), 11-3 
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RA, 3-8 
RB, 3-8 
RDATA (Receive Data), 14-5 
RDR (Receive Data Ready), 14-3 
REFRATE (Refresh Rate), 9-2 
reserved fields, A-1 | 
RM (Floating-Point Invalid Operand Mask), 2-15 
RMODE (Receive Mode), 14-3 
RS (Floating-Point Reserved Operand Sticky), 2-19 
RSIE (Receive Status Interrupt Enable), 14-2 
RT (Floating-Point Reserved Operand Trap), 2-19 
RW (Read/Write), 11-3, 11-7 
RXDI (Serial Port Receive Data Interrupt), 16-24 
RXS! (Serial Port Receive Status Interrupt), 16-24 
SB (Set Byte Pointer/Sign Bit), 3-8 
SC (Static-Column DRAM), 9-2 
SDIR (Shift Direction), 15-2 
SM (Supervisor Mode), 16-3 
ST (Set), 16-19 . 
STB (PSTROBE Level), 13-3 
STP (Stop Bits), 14-2 
TCV (Timer Count Value), 16-22—16-23 
TD (Timer Disable), 16-2 
TDATA (Transmit Data), 14-4 
_ TDELAY (Transfer Delay), 13-2 
TDELAYV (TDELAY Counter Value), 13-3 
TE (Trace Enable), 16-2 
TEMT (Transmitter Empty), 14-3 
THRE (Transmit Holding Register Empty), 14-3 
TMODE (Transmit Mode), 14-2 
TOPCNT (Top Margin Count), 15-3 
TP (Trace Pending), 16-2 
TR (Target Register), 16-19 
TRA (Transfer Active), 13-2 
TRV (Timer Reload Value), 16-23 
TTE (TDMA Terminate Enable), 11-3, 11-7 | 
‘TTI (TDMA Terminate Interrupt), 11-3, 11-7 _ 
TU (Trap Unaligned Access), 16-2 
TXDI (Serial Port Transmit Data Interrupt), 16-24 
UD (Transfer Up/Down), 11-3, 11-6—11-7 
UM (Floating-Point Underflow Mask), 2-15 
US (Floating-Point Underflow Sticky), 2-19 
UT (Floating-Point Underflow Trap), 2-19 
V (Overflow), 2-16 
VAB (Vector Area Base), 16-5 
VALID (Valid Mapping), 9-3 
‘VDATA (Video Data), 15-4 
VDI (Video Interrupt), 16-24 
VIDI (Video Invert), 15-3 
VIRTBASE (Virtual Base Address), 9-3 
VM (Floating-Point Overflow Mask), 2-15 
VS (Floating-Point Overflow Sticky), 2-19 
VT (Floating-Point Overflow Trap), 2-19 
WLGN (Word Length), 14-2 
WM (Wait Mode), 16-3 
WSO (Wait States, Bank 0), 8-2 
WS1 (Wait States, Bank 1), 8-2 
WS2 (Wait States, Bank 2), 8-2 
WS3 (Wait States, Bank 3), 8-2 
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XM (Floating-Point Inexact Result Mask), 2-15 
_XS (Floating-Point Inexact Result Sticky), 2-19 
XT (Floating-Point Inexact Result Trap), 2-19 

Z (Zero), 2-16 


Boolean data, 3-5. 


BOOTW signal 
definition, 7-3 
setting width of boot ROM, 8-3 


boundary-scan cells 
bypass scan path, 17-8 
description, 17-4-—17-5 
ICTEST1 scan path, 17-10 
ICTEST2 scan path, 17-10—-17-11 
instruction scan path, 17-8 
main data scan path, 17-8-17-10 


Boundary-Scan Register (BSR), 17-4-—17-5 
CNTL field, 17-3—17-4 


BP field (Byte Pointer) 
ALU Status Register, 2-16—2-17 | 
Byte Pointer Register, 3-3 


branch instructions 
CALL (Call Subroutine), 18-26 
CALLI (Call Subroutine, Indirect), 18-27 
JMP (Jump), 18-79 
JMPF (Jump False), 18-80 
JMPFDEC (Jump False and Decrement), 18-81 
JMPFI (Jump False Indirect), 18-82 , 
JMPI (Jump Indirect), 18-83 
JMPT (Jump True), 18-84 
JMPTI (Jump True Indirect), 18-85 
overview, 2-7 | 
table, 2-7 


breakpoints 
using assert instructions, 17-2 
using the HALT instruction, 17-2 


BRK bit (Send Break), 14-1 

BRKI bit (Break Interrupt), 14-4 

BRS bit (BUSY Relationship to STROBE), 13-3 
BSR. See Boundary-Scan Register (BSR) 
BSTO bit (Burst-Mode ROM, Bank 0), 8-1 

BST1 bit (Burst-Mode ROM, Bank 1), 8-2 
BST2 bit (Burst-Mode ROM, Bank 2), 8-2 
BSTS bit (Burst-Mode ROM, Bank 3), 8-2 

BSY bit (PBUSY Level), 13-4 

BURST signal, definition, 7-3 


burst-mode 
boot ROM state, 8-3 
DRAM page-mode accesses, 9-2, 9-6—9-7, 9-10 
multiple data accesses, 3-11 

- ROM accesses, 8-1, 8-2, 8-5, 8-6, 8-8, 8-10 


Burst-Mode Access signal. See BURST signal 


Index 


BWE bit (Byte Write Enable), 8-2 

BYPASS instruction, 17-8 

bypass scan path, 17-8 

Byte Pointer Register, description, 3-2-3-3 
byte writes, ROM space, 8-7-8-8 


C 


C bit (Carry) 
ALU Status Register, 2-16 
arithmetic operation status results, 2-17 
multiprecision integer operations, 2-25 


CALL (Call Subroutine) instruction, description, 18-26 


-CALLI (Call Subroutine, Indirect) instruction, 
description, 18-27 


calling conventions, 4-13—4-14 


CAS3-CASO signals 
definition, 7-4 
during static-column accesses, 9-2 
using as byte stobes, 8-2 


CAS-before-RAS refresh cycles 
DRAM refresh, 9-10 
REFRATE field (Refresh Rate), 9-2 
timing (diagram), 9-11 
CHA field (Channel Address), 16-18 | 
Channel Address Register 
description, 16-18 
multiple data accesses, 3-10 
Channel Control Register 
description, 16-18-16-19 
multiple data accesses, 3-10—3-11 


Channel Data Register, description, 16-18 
character data, format, 3-1—3-2 

character strings, overview, 3-4 

CHD field (Channel Data), 16-18 


CLASS (Classify Floating-Point Operand) instruction, 
_ description, 18-28—18-29 


~CLKDIV field (Clock Divide), 15-1 
CLKI bit (Clock Invert), 15-2 


clock signals 
~ INCLK, 7-1 
MEMCLK, 7-1 
TCK, 7-6 
UCLK, 7-6 
VCLK, 7-6 


CLZ (Count Leading Zeros) instruction, description, 
18-30 


CNTL field 
Boundary Scan Register, 17-3—-17-4 


AMD ct 
boundary-scan cells, 17-5 
Halt mode, 17-11-17-12 
ICTEST1 scan path, 17-10 
ICTEST2 scan path, 17-10—17-11 
Load Test Instruction mode, 17-13-17-14 
Step mode, 17-12—-17-13 


Column Address Strobes, Banks 3-0 signals. See 
CAS3—CASO signals 


compare instructions 

ASEQ (Assert Equal To), 18-16. 

ASGE (Assert Greater Than or Equal To), 18-17 

ASGEU (Assert Greater Than or Equal To, 
Unsigned), 18-18 

ASGT (Assert Greater Than), 18-19 

ASGTU (Assert Greater Than, Unsigned), 18-20 

ASLE (Assert Less Than or Equal To), 18-21 

ASLEU (Assert Less Than or Equal To, Unsigned), 
18-22 

ASLT (Assert Less Than), 18-23 | : 

ASLTU (Assert Less Than, Unsigned), 18-24 

ASNEQ (Assert Not Equal To), 18-25 

CPBYTE (Compare Bytes), 18-36 

CPEQ (Compare Equal To), 18-37 

CPGE (Compare Greater Than or Equal To), 18-38 

CPGEU (Compare Greater Than or Equal To, 
Unsigned), 18-39 

CPGT (Compare Greater Than), 18-40 , 

CPGTU (Compare Greater Than, Unsigned), 18-41 

CPLE (Compare Less Than or Equal To), 18-42 

CPLEU (Compare Less Than or Equal To, 
Unsigned), 18-43 

CPLT (Compare Less Than), 18-44 

CPLTU (Compare Less Than, Unsigned), 18-45 

CPNEQ (Compare Not Equal To), 18-46 

overview, 2-1—2-3 

table, 2-3 


compiler | 
delayed branches, 5-3—5-4 
High C 29K optimizing C compiler, 1-7 
run-time stack organization, 4-1—4-6 
temporary registers, 4-13 
transparent procedures, 4-13-4-15 


complementing a Boolean, 2-25 
Configuration Register, description, 2-26-—2-27 


CONST (Constant) instruction 
description, 18-31 
generation of large constants, 3-5 
large jump and call ranges, 2-25 


constant instructions 
CONST (Constant), 18-31 
CONSTH (Constant, High), 18-32 
CONSTN (Constant, Negative), 18-33 
overview, 2-5 
table, 2-5 


CONSTH (Constant, High) instruction 
description, 18-32 ; 
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generation of large constants, 3-5 
large jump and call ranges, 2-25 


CONSTN (Constant, Negative) instruction 
description, 18-33 | 
generation of large constants, 3-5 


CONVERT (Convert Data Format) instruction, 
description, 18-34—18-35 : 


CPBYTE (Compare Bytes) instruction 
character data, 3-2 
description, 18-36 
detection of characters within words, 3-4 


CPEQ (Compare Equal To) instruction, description, 
18-37 


CPGE (Compare Greater Than or Equal To) 
instruction 
complementing a Boolean, 2-25 
description, 18-38 


CPGEU (Compare Greater Than or Equal To, 
Unsigned) instruction, description, 18-39 


CPGT (Compare Greater Than) instruction, 
description, 18-40 


CPGTU (Compare Greater Than, Unsigned) 
instruction, description, 18-41 


CPLE (Compare Less Than or Equal To) instruction, 
description, 18-42 


CPLEU (Compare Less Than or Equal To, panes! 
instruction, description, 18-43 


CPLT (Compare Less Than) instruction, description, 


CPLTU (Compare Less Than, Unsigned) instruction, 
description, 18-45 


CPNEQ (Compare Not Equal To) instruction, 
description, 18-46 


CPU Status signals. See STAT2—STATO signals 


CR field (Load/Store Count Remaining) 
Channel Control Register, 16-19 
Load/Store Count Remaining Register, 3-11 
multiple access operations, 3-9-3-10 


CTE bit (Count Terminate Enable), 11-3, 11-7 
CTI bit (Count Terminate Interrupt), 11-4, 11-7 


Current Processor Status Register 
after an interrupt or trap, 16-11 
before interrupt return, 16-11 
control of tracing, 17-1-17-2 
delayed effects of registers, 5-5, 16-2 
description, 16-1-16-3 
Reset mode, 2-27 


CV bit (Contents Valid), 16-19 
multiple access operations, 3-10 
restarting faulting accesses, 16-17—16-18 
returning from interrupts or traps, 16-12 


D 


DA bit (Disable All Interrupts and Traps) 

Current Processor Status Register, 16-3 

disabling interrupts, 16-3 

exceptions during interrupt and trap handling, 16-21 
DACK1—DACKO signals, definition, 7-4—7-5 


DADD (Floating-Point Add, Double-Precision) 
instruction, description, 18-47 








data movement instructions 

EXBYTE (Extract Byte), 18-61 

EXHW (Extract Half-Word), 18-62 

EXHWS (Extract Half-Word, Sign-Extended), 18-63 

INBYTE (Insert Byte), 18-74 

INHW (Insert Half-Word), 18-75 

LOAD (Load), 18-86 

LOADL (Load and Lock), 18-87 

LOADM (Load Multiple), 18-88 

LOADSET (Load and Set), 18-89 

MFSR (Move from Special Register), 18-90 

MFTLB (Move from Translation Look-Aside Buffer 
Register), 18-91 

movement of large data blocks, 3-11 

MTSR (Move to Special Register), 18-92 — 

MTSRIM (Move to Special Register Immediate), 
18-93 | 

MTTLB (Move to Translation Look-Aside Buffer 
Register), 18-94 

overview, 2-4—2-6 

STORE (Store), 18-110 

STOREL (Store and Lock), 18-111 

STOREM (Store Multiple), 18-112 

table, 2-5 


Data Set Ready signal. See DSR signal 
Data Terminal Ready signal. See DTR signal 


data types 

floating-point data types, 3-5—3-7 
denormalized numbers, 3-7 
double-precision floating-point values, 3-6 
infinity, 3-7 
Not-a-Number, 3-6—3-7 
single-precision floating-point values, 3-5 
special floating-point values, 3-6-3-7 
zero, 3-7 

integer data types, 3-1—3- 5 
bit strings, 3-3-3-4 
Boolean data, 3-5 
character data, 3-1—3-2 
character string operations, 3-4 
half-word operations, 3-2 
instruction constants, 3-5 


DDIR bit (Data Direction) 
Parallel Port Control Register, 13-2 
Video Control Register, 15-2 
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Index 


DDIV (Floating-Point Divide, Double-Precision) 
instruction, description, 18-48 


debugging and testing 
accessing internal state via boundary-scan, 
17-14—17-16 | 
boundary-scan cells, 17-4—17-5 
control field in scan path, 17-3—17-4 
emulating the Am29205 microcontroller, 17-17 


forcing outputs to high impedance, 17-17-17-18 __ 


Halt mode, 17-11-17-12 

implementing a hardware-development system, 
17-11-17-17 

instruction breakpoints, 17-2 | 

Load Test Instruction mode, 17-13-17-14 

overview, 17-1 | 

processor status outputs, 17-2-17-3 

Step mode, 17-12-17-13 

Test Access Port, 17-4-17-11 

tracing, 17-1-17-2 


delayed branches, 5-2-5-4 
delayed effects of registers, 5-5 


DEQ (Floating-Point Equal To, Double-Precision) 
instruction, description, 18-49 


development tools 
AMD products, xx, 1-7 
compiler, xx, 1-7 
debugger, xx, 1-7 
development boards, xx, 1-7 
monitor, xx, 1-7 
third-party products, iii, xix—-xx, 1-7 
DF bit (Divide Flag), 2-16 _ 
DGE (Floating-Point Greater Than or Equal To, 
| Double-Precision) instruction, description, 18-50 
DGT (Floating-Point Greater Than, Double-Precision) 
instruction, description, 18-51 
DHH bit (Disable Hardware Handshake), 13-2-13-3 
DI bit (Disable Interrupts), 16-3 
disabling interrupts, 16-3 
DIV (Divide Step) instruction, description, 18-52 
DIVO (Divide Initialize) instruction, description, 18-53 
DIVIDE (Integer Divide, Signed) instruction, 
description, 18-54 


DIVIDU (Integer Divide, Unsigned) instruction, 
description, 18-55 


division, routines for performing, 2-19-2-20, 
2-22-2-24 


division instructions 
DDIV (Floating-Point Divide, Double-Precision), 
18-48 
DIV (Divide Step), 18-52 
DIVO (Divide Initialize), 18-53 
DIVIDE (Integer Divide, Signed), 18-54 
DIVIDU (Integer Divide, Unsigned), 18-55 


Index 
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DIVL (Divide Last Step), 18-56 

DIVREM (Divide Remainder), 18-57 

FDIV (Floating-Point Divide, Single-Precision), 
18-66 : 


DIVL (Divide Last Step) instruction, description, 18-56 


DIVREM (Divide Remainder) instruction, description, 
18-57 


DM bit (Floating-Point Divide-By-Zero Mask), 2-15 


DMA Acknowledge 1 through 0 signals. See 
DACK1—DACKO signals 


DMA controller 
DMA queuing (DMA Channel 0), 11-12 
DMA transfers, 11-8-11-12 
direct transfers, 11-1, 11-12-11-15 
external peripherals, 11-1 
external transfers, 11-1, 11-9-—11-11 
generating external DMA requests, 11-9 
internal peripherals, 11-1 
internal transfers, 11-1, 11-8—11-9 
latching external requests, 11-11-—11-12 
parallel port, 13-2 
serial port, 14-2, 14-3 
specifying direction, 11-8 © 
video interface, 15-2 
initialization, 11-7 
overview, 11-1 
programmable registers, 11-1—11-7 
signals 
DACK1—DACKO, 7-4—7-5 
DREQ1—DREQO, 7-4 
GACK, 7-5 
GREQ, 7-5 
TDMA, 7-5 


DMA Request 1 through 0 signals. See 
DREQi-—DREQO signals 


DMAO Address Register, description, 11-4 


DMAO Address Tail Register 
address assignments, 7-8-7-10 
description, 11-4 


DMAO Control Register, description, 11-1-11-4 
DMAO Count Register, description, 11-5 


DMAO Count Tail Register 
address assignments, 7-8—7-10 
description, 11-5 


DMAOI bit (DMA Channel 0 Interrupt), 16-24 
DMA1 Address Register, description, 11-7 
DMAI1 Control Register, description, 11-5—11-7 
DMA1 Count Register, description, 11-7 
DMA(1I bit (DMA Channel 1 Interrupt), 16-24 
DMACNT field (DMA Count), 11-5 

DMAEXT bit (DMA Extend), 11-2, 11-6 
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DMAWAIT field (DMA Wait States), 11-2, 11-6 


DMUL (Floating-Point Multiply, Double-Precision) 
instruction, description, 18-58 


DO bit (Integer Division Overflow Mask), 2-15 


DRAM accesses 

16-bit DRAM, 9-7 

32-bit DRAM, 9-7 

address multiplexing, 9-5—-9-7 
DRAM address mapping, 9-4—9-5 
DRAM refresh, 7-7, 9-10—9-12 

. Mapped accesses, 9-8 

normal access timing, 9-8 
page-mode access timing, 9-10 
restarting mapped DRAM accesses, 16-17—16-19 
video DRAM interface, 9-12 


DRAM Configuration Register, description, 9-2-9-3 
DRAM Control Register, description, 9-1-9-2 


DRAM controller 
See also DRAM accesses 
address mapping, 8-4, 9-4-9-5 
initialization, 9-4 
overview, 9-1 
‘programmable registers 
DRAM Configuration Register, 9-2—-9-3 
DRAM Control Register, 9-1—9-2 
signals | 
CAS3-CASO, 7-4 
RAS3-RASO, 7-3 
TR/OE, 7-4 
WE, 7-4 | 
DRAM Mapping Register 0, description, 9-3-9-4 
DRAM Mapping Register 1, description, 9-4 
DRAM Mapping Register 2, description, 9-4 
DRAM Mapping Register 3, description, 9-4 
DRAM refresh, panic mode, 7-7 
DRAMADDR field (DRAM Address), 11-4 


DREQ1—DREQ0 signals 

definition, 7-4 

signaling external DMA requests, 11-2, 11-6 
DRM field (DMA Request Mode), 11-2, 11-6 


DRQ bit (Data Request) 
Parallel Port Control Register, 13-2 
Video Control Register, 15-1 
DS bit (Floating-Point Divide-By-Zero Sticky), 2-19 
DSR bit (Data Set Ready), Serial Port Control 
Register, 14-1 


DSR signal 
activating, 14-1 
definition, 7-6 


DSUB (Floating-Point Subtract, Double-Precision) 
instruction, description, 18-59 
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DT bit (Floating-Point Divide-By-Zero Trap), 
2-18-2-19 
DTR bit (Data Terminal Ready), Serial Port Status 
Register, 14-4 
DTR signal 
activating, 14-4 
definition, 7-6 


DW field (Data Width), 11-2, 11-6 

DWO bit (Data Width, DRAM Bank 0), 9-2 
DWO field (Data Width, ROM Bank 0), 8-2 
DW1 bit (Data Width, DRAM Bank 1), 9-2 
DW1 field (Data Width, ROM Bank 1), 8-2 
DW2 bit (Data Width, DRAM Bank 2), 9-2 
DW2 field (Data Width, ROM Bank 2), 8-2 
DWS3 bit (Data Width, DRAM Bank 3), 9-2 
DWS3 field (Data Width, ROM Bank 3), 8-2 
dynamic parent, 4-12—4-13 


E 


EMULATE (Trap to Software Emulation Routine) 
instruction | 
description, 18-60 
operating-system calls, 2-24—2-25 


EN bit (Enable), 11-3, 11-7 
DMA initialization, 11-7 


endian. See big endian 


EXBYTE (Extract Byte) instruction 
BP field (Byte Pointer), 2-16—2-17 
Byte Pointer Register, 3-2 
character data, 3-1 
description, 18-61 


EXHW (Extract Half-Word) instruction 
BP field (Byte Pointer), 2-17 
Byte Pointer Register, 3-2 
description, 18-62 
half-word operations, 3-2 


EXHWS (Extract Half-Word, Sign-Extended) 
instruction 
BP field (Byte Pointer), 2-17 
Byte Pointer Register, 3-2 
description, 18-63 
half-word operations, 3-2 


External Memory Grant Acknowledge signal. See 
GACK signal 


External Memory Grant Request signal. See GREQ 
signal 


EXTEST instruction, 17-6 


EXTRACT (Extract Word, Bit-Aligned) instruction 
bit strings, 3-3 
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description, 18-64 
FC field (Funnel Shift Count), 2-17 
operating on double-word data, 2-4 


F 


FACK bit (Force ACK), 13-2 


FADD (Floating-Point Add, Single-Precision) i instruc- 
tion, description, 18-65 


FBUSY bit (Force Busy), 13-2 


FC field (Funnel Shift Count), 3-3—3-4 
ALU Status Register, 2-17 


FDIV (Floating-Point Divide, Single-Precision) instruc- 
tion, description, 18-66 


‘FDMUL (Floating-Point Multiply, Single-to-Double Pre- 
cision) instruction, description, 18-67 


FEQ (Floating-Point Equal To, Single-Precision) 
instruction, description, 18-68 


FER bit (Framing Error), 14-4 
FF bit (Fast Float Select), 2-14 


FGE (Floating-Point Greater Than or Equal To, 
Single-Precision) instruction, description, 18-69 


FGT (Floating-Point Greater Than, Single-Precision) 
instruction, description, 18-70 


field summary 
peripheral registers (table), C-6-C-11 
processor registers vevle). B-7—B-9 


fields. See bits 
fill handlers, 4-11 


floating-point data types 
denormalized numbers, 3-7 
double-precision floating- poms values, 3-6 
infinity, 3-7 
Not-a-Number, 3-6—3-7 
single-precision floating-point values, 3-5 
special peal rent values, 3-6—3-7 
zero, 3-7 | 


Floating-Point Environment Register 
description, 2-14—-2-15 
not implemented in processor hardware, 2-11° 


Floating-Point Exception trap 
Floating-Point Environment Register, 2-15 
Floating-Point Status Register, 2-18-2-19 
trap status bits, 2-18-2-19 


floating-point instructions 
CLASS (Classify Floating-Point Operand), 
18-28—18-29 
CONVERT (Convert Data Format), 18-34—18-35 
DADD (Floating-Point Add, cae aca 
18-47 
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DDIV (Floating-Point Divide, Double-Precision), 
18-48 

DEQ (Floating-Point Equal To, Double-Precision), 
18-49 | 

DGE (Floating-Point Greater Than or Equal To, 
Double-Precision), 18-50 

DGT (Floating-Point Greater Than, Double- 
Precision), 18-51 

DMUL (Floating-Point Multiply, Double-Precision), 
18-58 

DSUB (Floating-Point Subtract, Double-Precision), 

7 18-59 


FADD (Floating-Point Add, Single-Precision), 18-65 

FDIV (Floating-Point Divide, Single-Precision), 
18-66 

FDMUL (Floating-Point Multiply, Single-to-Double 

Precision), 18-67 

FEQ (Floating-Point Equal To, Single-Precision), 
18-68 

FGE (Floating-Point Greater Than or Equal To, 
Single-Precision), 18-69 

FGT (Floating-Point Greater Than, Single- 
Precision), 18-70 

FMUL (Floating-Point Multiply, Single- Precision), 
18-71 

FSUB (Floating-Point Subtract, Single-Precision), 
18-72 

overview, 2-6 

SQRT (Floating-Point Square Root), 18-107 

status results, 2-18 

table, 2-6 

Floating-Point Status Register 

description, 2-18-2-19 

not implemented in processor hardware, 2-11 

Protection Violation trap, 2-26 . 

sticky status bits, 2-18-2-19 

trap status bits, 2-18-2-19 

FMUL (Floating-Point Multiply, Single-Precision) 
instruction, description, 18-71 


fp. See frame pointer (fp) 
frame, definition, 4-7 


frame pointer (fp) 
definition, 4-5 
register conventions, 4-14 


Freeze bit. See FZ bit (Freeze) 
FRM field (Floating-Point Round Mode), 2-14 


FSUB (Floating-Point Subtract, Single-Precision) 
instruction, description, 18-72 


Funnel Shift Count Register, description, 3-3-3-4 


FWT bit (Full Word Transfer), 13-1 
required setting for Am29205 microcontroller, A-1 | 


FZ bit (Freeze) 
Current Processor Status Register, 16-2 
delayed effects of registers, 5-5, 16-2 
Halt mode, 17-12 
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interrupt and trap handling, 16-6-16-13 
lightweight interrupt processing, 16-12-16-13 
Program Counter registers, 16-6—16-9 
registers affected by, 16-10, 16-11-16-12 
restarting the interrupt or trap handler, 16-21 
Step mode, 17-12 


G 


GACK signal 
definition, 7-5 
direct DMA access by external devices, 
11-12-11-15 


general-purpose registers 
addressing terminology, 2-10 
operands held by, 2-8—2-10 
organization, 2-9, B-1 
overview, 2-8-2-10 


global registers 
global-register number, 2-10 
overview, 2-10 


GREQ signal 
definition, 7-5 
direct DMA access by external devices, 
11-12-11-15 


H 


half-word data, format, 3-2 


HALT (Enter Halt Mode) instruction 
description, 18-73 | 
instruction breakpoints, 17-2, 17-16—17-17 


Halt mode, 17-11-17-12 


hardware-development system. See debugging and 
testing 


host interface (HIF) specification. See operating 
system 


| 
I/O port. See Programmable I/O Port (PIO) 
ICTEST1 instruction, 17-7 
ICTEST1 scan path, 17-10 
ICTEST2 instruction, 17-7 
ICTEST2 scan path, 17-10—17-11 
1D31-IDO signals, definition, 7-1 
IE bit (Interrupt Enable), 16-23 


IEEE floating-point standard | 
implementation, 3-5-3-7 
improving performance of, 2-14 
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lilegal Opcode trap 
instruction breakpoints, 17-2 
unimplemented instructions, 2-1 
unpredictable vector number, 16-4 


IM field (Interrupt Mask), 16-3 
enabling interrupts, 16-3 


IN bit (Interrupt), 16-23 


INBYTE (Insert Byte) instruction 
BP field (Byte Pointer), 2-16—2-17 
Byte Pointer Register, 3-2 
character data, 3-2 
description, 18-74 


INCLK signal, definition, 7-1 

Indirect Pointer A Register, description, 2-13 
Indirect Pointer B Register, description, 2-14 
Indirect Pointer C Register, description, 2-13 


indirect pointers | 
delayed effects of registers, 5-5 
set by certain instructions, 2-13 


INHW (Insert Half-Word) instruction 
BP field (Byte Pointer), 2-17 
Byte Pointer Register, 3-2 
description, 18-75 
half-word operations, 3-2 
initialization 
See also processor initialization 
DMA controller, 11-7 
DRAM controller, 9-4 
internal interrupt controller, 16-23—16-25 
parallel port, 13-4—13-5 
Peripheral Interface Adapter (PIA), 10-2 
programmable I/O port, 12-3 
ROM controller, 8-3-8-4 
serial port, 14-5 
timer facility, 16-21 
video interface, 15-4 


Input Clock signal. See INCLK 

Instruction Bus signals. See ID31-IDO signals 
instruction constants, 3-5 

instruction scan path, 17-8 

instruction scheduling. See pipelining 


instruction set 
ADD (Add), 18-8 
ADDC (Add with Carry), 18-9 

. ADDCS (Add with Carry, Signed), 18-10 
ADDCU (Add with Carry, Unsigned), 18-11 
ADDS (Add, Signed), 18-12 
ADDU (Add, Unsigned), 18-13 
AND (AND logical), 18-14 
ANDN (AND-NOT logical), 18-15 
arithmetic operation status results, 2-17 
ASEQ (Assert Equal To), 18-16 


Index 


ASGE (Assert Greater Than or Equal To), 18-17 
ASGEU (Assert Greater Than or Equal To, 
Unsigned), 18-18 
ASGT (Assert Greater Than), 18-19 
ASGTU (Assert Greater Than, Unsigned), 18-20 
ASLE (Assert Less Than or Equal To), 18-21 
ASLEU (Assert Less Than or Equal To, Unsigned), 
18-22 — 
ASLT (Assert Less Than), 18-23 
ASLTU (Assert Less Than, Unsigned), 18-24 
ASNEQ (Assert Not Equal To), 18-25 
assembler syntax, 18-3—18-64 
branch instructions, 2-7 
CALL (Call Subroutine), 18-26 
CALLI (Call Subroutine, Indirect), 18-27 
CLASS (Classify Floating-Point Epeenes 
18-28-18-29 
CLZ (Count Leading Zeros), 18-30 
compare instructions, 2-1—2-3 
CONST (Constant), 18-31 
constant instructions, 2-5 
CONSTH (Constant, High), 18-32 
CONSTN (Constant, Negative), 18-33 
control-flow terminology, 18-3 
CONVERT (Convert Data Format), 18-34—-18-35 
CPBYTE (Compare Bytes), 18-36 
CPEQ (Compare Equal To), 18-37 
CPGE (Compare Greater Than or Equal To), 18-38 
CPGEU (Compare Greater Than or Equal To, 
Unsigned), 18-39 
~ CPGT (Compare Greater Than), 18-40 
CPGTU (Compare Greater Than, Unsigned), 18-41 
CPLE (Compare Less Than or Equal To), 18-42 
CPLEU (Compare Less Than or Equal To, 
Unsigned), 18-43 
CPLT (Compare Less Than), 18-44 
-CPLTU (Compare Less Than, Unsigned), 18-45 
CPNEQ (Compare Not Equal To), 18-46 
DADD (Floating-Point Add, Double-Precision), 
18-47 
data movement instructions, 2-4—2-6 
DDIV (Floating-Point Divide, Double-Precision), 
18-48 
DEQ (Floating-Point Equal To, Double-Precision), 
18-49 
description format, 18-7 
descriptions, 18-8—18-126 
DGE (Floating-Point Greater Than or Equal To, 
Double-Precision), 18-50 
DGT (Floating-Point Greater Than, Double- 
Precision), 18-51 
DIV (Divide Step), 18-52 
DIVO (Divide Initialize), 18-53 
DIVIDE (Integer Divide, Signed), 18-54 
DIVIDU (Integer Divide, Unsigned), 18-55 
DIVL (Divide Last Step), 18-56 
DIVREM (Divide Remainder), 18-57 
DMUL (Floating-Point Multiply, Double-Precision), 
18-58 
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DSUB (Floating-Point Subtract, Double-Precision), 
18-59 

EMULATE (Trap to Software Emulation Routine), 
18-60 

EXBYTE (Extract Byte), 18-61 

EXHW (Extract Half-Word), 18-62 | 

EXHWS (Extract Half-Word, Sign-Extended), 18-63 

EXTRACT (Extract Word, Bit-Aligned), 18-64 — 

FADD (Floating-Point Add, Single-Precision), 18-65 

FDIV (Floating-Point Divide, Single-Precision), 
18-66 

FDMUL (Floating-Point Multiply, Single-to-Double 
Precision), 18-67 

FEQ (Floating-Point Equal To, Single-Precision), 
18-68 

FGE (Floating-Point Greater Than or Equal To, 
Single-Precision), 18-69 

FGT (Floating-Point Greater Than, Single- 
Precision), 18-70 

floating-point instructions, 2-6 

floating-point operation status results, 2-18 

FMUL (Floating-Point Multiply, Single-Precision), 
18-71 

FSUB (Floating-Point Subtract, Single- Precision), 
18-72 

HALT (Enter Halt Mode), 18-73 

INBYTE (Insert Byte), 18-74 

INHW (insert Half-Word), 18-75 

instruction formats, 18-4—18-5 

integer arithmetic instructions, 2-1-2-3 _ 

INV (Invalidate), 18-76 

IRET (Interrupt Return), 18-77 

IRETINV (Interrupt Return and Invalidate), 18-78 

JMP (Jump), 18-79 

JMPF (Jump False), 18-80 

JMPFDEC (Jump False and Decrement), 18-81 

JMPFI (Jump False Indirect), 18-82 

JMPI (Jump Indirect), 18-83 

JMPT (Jump True), 18-84 

JMPTI (Jump True Indirect), 18-85 | 

LOAD (Load), 18-86 

load and store instructions, 3-7—3-9 

LOADL (Load and Lock), 18-87 

LOADM (Load Multiple), 18-88 

LOADSET (Load and Set), 18-89 

logical instructions, 2-4 

logical operation status results, 2-17—2-18 

MFSR (Move from Special Register), 18-90 

MFTLB (Move from Translation Look-Aside Butter 
Register), 18-91 

miscellaneous instructions, 2-7—2-9 

MTSR (Move to Special Register), 18-92 

MTSRIM (Move to Special Register Immediate), 
18-93 

MTTLB (Move to Translation Look-Aside Buffer 
Register), 18-94 

MUL (Multiply Step), 18-95 

MULL (Multiply Last Step), 18-96 

MULTIPLU (Integer Multiply, Unsigned), 18-97 

MULTIPLY (Integer Multiply, Signed), 18-98 
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MULTM (Integer Multiply Most Significant Blts, 
Signed), 18-99 | 

MULTMU (Integer Multiply Most Significant Bits, 
Unsigned), 18-100 

MULU (Multiply Step, Unsigned), 18-101 

NAND (NAND Logical), 18-102 

NOR (NOR Logical), 18-103 

operand notation and symbols, 18-1—18-2 

operation code index, 18-127—18-129 

’ operator symbols, 18-2—18-3 

OR (OR Logical), 18-104 

overview, 2-1—2-8 

reserved instructions, 2-8 

SETIP (Set Indirect Pointers), 18-105 

shift instructions, 2-4 

SLL (Shift Left Logical), 18-106 

SQRT (Floating-Point Square Root), 18-107 

SRA (Shift Right Arithmetic), 18-108 

SRL (Shift Right Logical), 18-109 

STORE (Store), 18-110 

STOREL (Store and Lock), 18-111 

STOREM (Store Multiple), 18-112 

SUB (Subtract), 18-113 

SUBC (Subtract with Carry), 18-114 

SUBCS (Subtract with Carry, Signed), 18-115 

SUBCU (Subtract with Carry, Unsigned), 18-116 

SUBR (Subtract Reverse), 18-117 

SUBRC (Subtract Reverse with Carry), 18-118 

SUBRCS (Subtract Reverse with Carry, Signed), 
18-119 

SUBRCU (Subtract Reverse with Carry, Unsigned), 
18-120 

SUBRS (Subtract Reverse, Signed), 18-121 

SUBRU (Subtract Reverse, Unsigned), 18-122 

SUBS (Subtract, Signed), 18-123 

SUBU (Subtract, Unsigned), 18-124 

- terminology, 18-1—18-4 
XNOR (Exclusive-NOR Logical), 18-125 
XOR (Exclusive-OR Logical), 18-126 


integer arithmetic instructions. See arithmetic 
instructions 
integer data types, 3-1—3-5 
Integer Environment Register, description, 2-15—-2-16 
internal peripherals 
address assignments (table), 7-10 


alternate register addresses, 7-8—7-9 
DMA transfers, 11-1 


Interrupt Control Register, description, 16-23—16-24 


Interrupt Requests 3-0 signals. See INTR3-INTRO 
signals 


interrupts, enabling and disabling, 16-3 
interrupts and traps 
Current Processor Status Register, description, 
16-1-16-3 
exception reporting and restarting, 16-16—-16-21 
Channel Address Register, 16-18 
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Channel Control Register, 16-18—16-19 
Channel Data Register, 16-18 
correcting out-of-range results, 16-20 
exceptions during interrupt and trap handling, 
16-21 
floating-point exceptions, 16-20 
instruction exceptions, 16-16 
integer exceptions, 16-19-16-20 
restarting faulting accesses, 16-17-16-19 
external interrupts and traps, 16-4 
interrupt controller 
initialization, 16-25 
Interrupt Control Register, 16-25 
overview, 16-23 
servicing internal interrupts, 16-25 
interrupts, 16-3 | 
latency, 16-4 
lightweight interrupt processing, 16-12—-16-13 
Old Processor Status Register, description, 16-6 
overview, 16-1 
_ priority (table), 16-15 
Program Counter stack, 16-6~16-10 
Program Counter 0 Register, 16-9 
Program Counter 1 Register, 16-9 
Program Counter 2 Register, 16-10 
teturning from an interrupt or trap, 16-11-16-12 
sequencing, 16-14—16-16 
simulation of interrupts and traps, 16-13—-16-14 
taking an interrupt or trap, 16-10 
Timer Facility | 
handling timer interrupts, 16-22 
initialization, 16-21 
overview, 16-21 
Timer Counter Register, 16-22—16-23 
Timer Reload Register, 16-23 
uses, 16-22 
traps, 16-4 
vector area, 16-5—16-6 
Vector Area Base Address Register, description, 
16-5 | 
vector numbers 
assignments (table), 16-7-16-9 
definition, 16-5-16-6 
Wait mode, 16-4—16-5 
WARN input, 16-14—16-16 
WARN trap, 16-13-16-14 


INTEST instruction, 17-6—17-7 


INTR3—INTRO signals 
definition, 7-2 
interrupts, 16-3, 16-4 
INTR3, internal interrupt controller, 16-23—16-25 


INV (invalidate) instruction, description, 18-76 
INVERT field (PIO Inversion), 12-2 

IOEXTO bit (Input/Output Extend, Region 0), 10-2 
IOEXT1 bit (Input/Output Extend, Region 1), 10-2 
IOEXT2 bit (Input/Output Extend, Region 2), 10-2 





Index 


lIOEXTS bit (Input/Output Extend, Region 3), 10-2 
1OEXT4 bit (Input/Output Extend, Region 4), 10-2 
lOEXTS bit (Input/Output Extend, Region 5), 10-2 
IOPI field (I/O Port Interrupt), 16-24 


lOWAITO field (Input/Output Wait States, Region 0), 
10-2 

lOWAIT1 field (Input/Output Wait States, Region 1), 
10-2 

lOWAIT2 field (Input/Output Wait States, Region 2), 


IOWAITS field (Input/Output Wait States, Region 3), 
10-2 


IOWAIT4 field (Input/Output Wait States, Region 4), 


IOWAITS field (Input/Output Wait States, Region 5), 
10-2 


IP bit (Interrupt Pending), 16-2 

IPA field (Indirect Pointer A), 2-13 
IPB field (Indirect Pointer B), 2-14 
IPC field (Indirect Pointer C), 2-13 


IRET (Interrupt Return) instruction 
description, 18-77 
restarting mapped DRAM accesses, 16- 17-16- 18 
returning from interrupts and traps, 16-11-16-12 


IRETINV (Interrupt Return and ee) instruction 
description, 18-78 
restarting mapped DRAM accesses, 16-17—16-18 
returning from interrupts and traps, 16-11-16-12 


IRM14-IRM8 fields, 12-2 


IRM15 field (Interrupt Request Mode, PIO15), 
12-1-12-2 


JMP (Jump) instruction, description, 18-79 
JMPF (Jump False) instruction, description, 18-80 


JMPFDEC (Jump False and Decrement) instruction, 
description, 18-81 


JMPFIi (Jump False Indirect) instruction, description, 
18-82 | 

JMPI (Jump Indirect) instruction, description, 18-83 

JMPT (Jump True) instruction, description, 18-84 

sae e True Indirect) instruction, description, 


JTAG 1149.1 boundary-scan interface 
See also Test Access Port 
IEEE standard document, xx . 
signals 
TCK, 7-6 
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TDI, 7-6 
TDO, 7-7 
TMS, 7-6 
TRST, 7-7 


jump instructions 
JMP (Jump), 18-79 
JMPF (Jump False), 18-80 
JMPFDEC (Jump False and Decrement), 18-81 
JMPFI (Jump False Indirect), 18-82 
JMPI (Jump Indirect), 18-83 
JMPT (Jump True), 18-84 
JMPT! (Jump True Indirect), 18-85 


jumps 
delayed branches, 5-2—-5-4 
large jump and call ranges, 2-25 


L 


large return pointer (Irp) 
description, 4-10 
register conventions, 4-13 


leaf procedures : 
calling other procedures, 4-8 
register stack leaf frames, 4-11 


LEFTCNT field (Left Margin Count), 15-3 
Line Synchronization signal. See LSYNC signal 
LINECNT field (Line Count), 15-3 
LM bit (Large Memory), 8-2, 9-2 
LOAD (Load) instruction, description, 18-86 
load and store instructions 
BP field (Byte Pointer), 2-17 
format, 3-7—-3-9 
OPT field (Option), 3-8 
RA, 3-8 
RB or I, 3-8 
SB bit (Set Byte Pointer/Sign Bit), 3-8 
load operations, 3-9 
multiple accesses, 3-9—3-11 
overlapped loads and stores, 5-4-5-5 
store operations, 3-9 


Load Test Instruction mode, 17-13-17-14 
Load/Store Count Remaining Register, description, 


LOADL (Load and Lock) instruction, description, 
18-87 


LOADM (Load Multiple) instruction 
description, 18-88 
multiple data accesses, 3-9-3-11 


LOADSET (Load and Set) instruction, description, 
18-89 


local registers 
local-register number, 2-10 
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overview, 2-10-2-11. 


logical instructions 
AND (AND logical), 18-14 
_ ANDN (AND-NOT logical), 18-15 
NAND (NAND Logical), 18-102 
NOR (NOR Logical), 18-103 . 
OR (OR Logical), 18-104 
overview, 2-4 
SLL (Shift Left Logical), 18-106 
SRL (Shift Right Logical), 18-109 
status results, 2-17—2-18 
table, 2-4 
XNOR (Exclusive-NOR Logical), 18-125 
XOR (Exclusive-OR Logical), 18-126 


LOOP bit (Loopback), 14-1 

Irp.. See large return pointer (irp) 
LS bit (Load/Store), 16-19 

LSI bit (Line Sync Invert), 15-2 
LSYNC signal, definition, 7-6 


main data scan path, 17-8-17-10 

MEMCLK signal, definition, 7-1 

Memory Clock signal. See MEMCLK signal 
memory frame pointer (mfp), description, 4-12 
memory map, 7-8-7-10 


Memory Stack 
description, 4-5—4-6 
local variables and memory-stack frames, 
4-11-4-12 
prologue and epilogue routines for allocation, 4-12 
storage allocation, 4-2 


memory stack pointer (msp) 
description, 4-6, 4-12 
register conventions, 4-14 


memory-stack frame, 4-11-4-12 


MFSR (Move from Special Register) instruction 
accessing special-purpose registers, 2-8 
description, 18-90 


_ MFTLB (Move from Translation Look-Aside Buffer 
Register) instruction, description, 18-91 


miscellaneous instructions 
CLZ (Count Leading Zeros), 18-30 
EMULATE (Trap to Software Emulation Routine), 
18-60 
HALT (Enter Halt Mode), 18-73 
INV (Invalidate), 18-76 
IRET (interrupt Return), 18-77 
IRETINV (Interrupt Return and Invalidate), 18-78 
overview, 2-7—2-9 
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SETIP (Set Indirect Pointers), 18-105 
table, 2-8 


ML bit (Multiple Operation) 
Channel Control Register, 16-19 
multiple data accesses, 3-10 
returning from interrupts or traps, 16-12 


MO bit het ie Multiplication Overflow Pace 
Mask), 2-16 


MODE field (Parallel Port Mode), Parallel Port Control 
Register, 13-2 


MODE field (Video Interface Mode), Video Control 
Register, 15-2 


msize value, 4-15 
msp. See memory stack pointer (msp) 


MTSR (Move to Special Register) instruction 
accessing special-purpose registers, 2-8 
BP field (Byte Pointer), 2-17 — 
description, 18-92 
FC field (Funnel Shift Count), 2-17 


MTSRIM (Move to Special Register nee 
instruction 
accessing special-purpose registers, 2-8 
description, 18-93 


MTTLB (Move to Translation Look-Aside Buffer Regis- 
ter) instruction, description, 18-94 


MUL (Multiply Step) instruction, description, 18-95 


MULL (Multiply Last Step) instruction, description, 
18-96 


multiple data accesses 
description, 3-9-3-11 
Load/Store Count Remaining Register, 3-11 
movement of large data blocks, 3-11 


multiplication, routines for performing, 2-19-2-20 


multiplication instructions 
DMUL (Floating-Point Multiply, Double-Precision), 
18-58 
FDMUL (Floating-Point Multiply, Single-to-Double 
Precision), 18-67 
FMUL (Floating-Point Multiply, Single-Precision), 
18-71 


~ MUL (Multiply Step), 18-95 


MULL (Multiply Last Step), 18-96 

MULTIPLU (Integer Multiply, Unsigned), 18-97 

MULTIPLY (Integer Multiply, Signed), 18-98 

MULTM (Integer Multiply Most Significant Bits, 
Signed), 18-99 

MULTMU (Integer Multiply Most Significant Blts, 
Unsigned), 18-100 

MULU (Multiply Step, Unsigned), 18-101 


MULTIPLU (Integer Multiply, vaeaned) instruction, 
description, 18-97 


MULTIPLY (Integer Multiply, Signed) instruction, 
description, 18-98 


MULTM (Integer Multiply Most Significant Bits, 
Signed) instruction, description, 18-99 — 


MULTMU (Integer Multiply Most Significant Bits, 
Unsigned) instruction, description, 18-100 


MULU (Multiply Step, Unsigned) instruction, 
description, 18-101 


N 


N bit (Negative) 
ALU Status Register, 2-16 
arithmetic operation status results, 2-17 
logical operation status results, 2-17-2-18 
NAND (NAND Logical) instruction, description, 18-102 
NM bit (Floating-Point Invalid Operation Mask), 2-15 
NN bit (Not Needed) | 
Channel Control Register, 16-19 
restarting faulting accesses, 16-17—16-18 
returning from interrupts or traps, 16-12 
NO-OPs, 2-25-2-26 
NOR (NOR Logical) instruction, description, 18-103 
Not-a-Number 
_ definition, 3-6—3-7 
Quiet NaNs (QNaNs), 3-6-3-7 
Signaling NaNs (SNaNs), 3-6—-3-7 . 
NS bit (Floating-Point Invalid Operation Sticky), 2-19 
NT bit (Floating-Point Invalid Operation Trap), 2-19 


OER bit (Overrun Error), 14-4 


Old Processor Status Register 
control of tracing, 17-1—17-2 
description, 16-6 — 
operating system services, host interface (HIF) 
specification, xx, 1-7, 16-8 
operating-system calls, 2-24—2-25 
OPT field (Option) 
alignment of words and half-words, 3-13 
byte and half-word accesses, 3-12-3-13 
load and store instruction format, 3-8 
OR (OR Logical) instruction, description, 18-104 
Out-of-Range trap 
correcting out-of-range results, 16-20 
Integer Environment Register, 2-15—2-16 
integer exceptions, 16-19-16-20 
OV bit (Overflow), 16-23 


overflow. See spill handler 
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PACK signal 
assertion duration, 13-2 
definition, 7-5 
disabling hardware handshakes, 13-2-13-3 
forcing active level, 13-2 
signal level, 13-4 


Page Synchronization signal. See PSYNC signal 


page-mode 
DRAM accesses, 9-10 
DRAM page-mode read cycle (diagram), 9. 10 
DRAM page-mode write cycle (diagram), 9-11 
sequential DRAM accesses, 9-6-9-7 , 
specifying, 9-2 
static-column accesses, 9-2 


Parallel Data Register (PDR), 17-4—17-5 


parallel port | , 
initialization, 13-4—13-5 
internal DMA transfers, 11-1, 11-8—-11-12, 13-2 
overview, 13-1 
programmable registers 
Parallel Port Control Register, 13-1—13-3 
Parallel Port Data Register, 13-4 | 
Parallel Port Status Register, 13-3—-13-4 
signals 
PACK, 7-5 
PAUTOFD, 7-5 
‘ PBUSY, 7-5: 
POE, 7-5 
PSTROBE, 7-5 
PWE, 7-5 
transfers from the host, 13-5 
transfers to the host, 13-5—13-7 


Parallel Port Acknowledge signal. See PACK signal 
Parallel Port Autofeed signal. See PAUTOFD signal 
Parallel Port Busy signal. See PBUSY signal 

Parallel Port Control Register, description, 13-1-13-3 


Parallel Port Data Register, description, 13-4 


Parallel Port Output Enable signal. See POE sional 


Parallel Port Status Register 
address assignments, 7-9-7-11 
description, 13-3—13-4 


Parallel Port Strobe signal. See PSTROBE signal 
Parallel Port Write Enable signal. See PWE signal 


PAUTOFD signal 
definition, 7-5 
signal level, 13-3 


PBUSY signal | 
assertion duration, 13-2 
definition, 7-5 | 
disabling hardware handshakes, 13-2-13-3 
forcing active level, 13-2 
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signal level, 13-4 
PCO field (Program Counter 0), 16-9 
PC1 field (Program Counter 1), 16-9 
PC2 field (Program Counter 2), 16-10 
PDATA field (Parallel Port Data), 13-4 
PDR. See Paralle! Data Register (PDR) 
PER bit (Parity Error), 14-4 
PERADDR field (Peripheral Address), 11-4 


Peripheral Chip Selects, Regions 5-0 signals. See 
_ PIACS5—PIACSO signals 


Peripheral Interface Adapter (PIA) 
See also PIA 
initialization, 10-2 
overview, 10-1 
PIA accesses, 10-2-10-4 
extending a PIA read cycle with WAIT (diagram), 
10-5 
extending a PIA write cycle with WAIT (diagram), 
10-5 
extending I/O cycles, 10-3—10-4 
normal access timing, 10-2-10-3 
PIA read cycle (diagram), 10-3 
PIA write cycle (diagram), 10-4 
programmable registers, 10-1—10-2 
Signals 7 
PIACS5—PIACSO, 7-4 
PIAOE, 7-4 
PIAWE, 7-4 


Peripheral Output Enable signal. See PIAOE signal 


peripheral registers 
See also registers 
address assignments, 7-8—7-10 
field summary (table), C-6—C-11 
register summary, C-1—C-10 


Peripheral Write Enable signal. See PIAWE oor 
PGO (Page-Mode DRAM, Bank 0), 9-2 

PG1 (Page-Mode DRAM, Bank 1), 9-2 

PG2 (Page-Mode DRAM, Bank 2), 9-2 

PG3 (Page-Mode DRAM, Bank 3), 9-2 

PHYBASE field (Physical Base Address), 9-4 

PIA Control Register 0, description, 10-1—10-2 
PIA Control Register 1, description, 10-1-10-2 


PIACS5-—PIACSO signals 
definition, 7-4 
external DMA transfers, 11-3, 11-6 
PIAOE signal 
definition, 7-4 
extending PIA accesses, 10-2 
external DMA transfers, 11-9—11-11 
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PIAWE signal 
definition, 7-4 

_ extending PIA accesses, 10-2 
external DMA transfers, 11-9—11-11 


pin changes, Am29205 microcontroller, 7-7 
PIN field (PIO Input), 12-2 
PIO Control Register, description, 12-1-12-2 
PIO Input Register, description, 12-2 
PIO Output Enable Register, description, 12-3 
PIO Output Register, description, 12-2—12-3 
PIO15—PIO0 signals, definition, 7-5 
Pipeline Hold mode, multiple data accesses, 3-10 
pipelining 
delayed branch, 5-2-5-4 
delayed effects of registers, 5-5 
four-stage instruction execution, 5-1 
overlapped loads and stores, 5-4—5-5 
overview, 5-1 © 


Pipeline Hold mode, 5-1—5-2 
serialization, 5-2 | 


PMODE field (Parity Mode), 14-2 
POE signal, definition, 7-5 
POEN field (PIO Output Enable), 12-3 


pointers 
See also indirect pointers 
frame pointer (fp), 4-5 
register allocate bound pointer (rab), 4-5 
register free bound pointer (rfb), 4-5 
register stack pointer (rsp), 4-5 


POUT field (PIO Output), 12-3 
PPI bit (Parallel Port Interrupt), 16-24 
PRL field (Processor Release Level), 2-27 


procedure linkage 
argument passing, 4-7-4-8 
complex procedure call example, 4-14 
conventions, 4-6—4-13 
fill handlers, 4-11 
local variables and memory-stack frames, 
4-11-4-12 
Memory Stack, 4-5—-4-6 
overview, 4-1 
procedure epilogue, 4-10—4- 11 
procedure prologue 
_ allocation of memory-stack frames, 4-11-4-12 
definition, 4-8 
frame allocation on Register Stack, 4-8—4-9 
Register Stack, 4-3 
register stack leaf frame, 4-11 
register usage convention, 4-13-4-14 
return values, 4-10 
run-time stack, 4-1—4-6 
spill handler, 4-10 


Static link pointer, 4-12-4-13 
trace-back tags, 4-15-4-17 
transparent procedures, 4-13 


processor initialization, 2-26—2-28 
See also initialization 
Configuration Register, 2-26-2-27 
Current Processor Status Register, 2-27 
overview, 2-26 
Reset mode, 2-27-2-28 


processor registers 
field summary (table), B-7-B- 9 
register summary, B-1—B-9 


processor signals 
A23—A0, 7-1 
ID31—IDO, 7-1 
INTR3-INTRO, 7-2 
RW, 7-2 
RESET, 7-2 
STAT2-STATO, 7-2 
TRAP1—TRAPO, 7-3 
WAIT, 7-1 
WAIT/TRIST, 7-2 
WARN, 7-2 © 


product support 
bulletin board service, iii 
documentation and literature, iii, xix—xx 
technical support hotline, iii 


Program Counter 0 Register, description, 16- 9 
Program Counter 1 Register, description, 16-9 
Program Counter 2 Register, description, 16-10 


Programmable I/O Port ay 

See also PIO 

initialization, 12-3 

operating the I/O port, 12-3 

overview, 12-1 

programmable registers 
PIO Control Register, 12-1-12-2 
PIO Input Register, 12-2 
PIO Output Enable Register, 12-3 
PIO Output Register, 12-2-12-3 

signals, PlIO15—PIO0, 7-5 


Programmable Input/Output signals. See PIO15—PIOO 
signals 


programming 
activation records, 4-1—4-6 
ALU Status Register, 2-16—2-17 
arithmetic operation status results, 2-17 
branch instructions, 2-7 
compare instructions, 2-1 
complementing a Boolean, 2-25 
Configuration Register, 2-26-2-27 
constant instructions, 2-5 
data movement instructions, 2-4 
division, 2-22-—2-24 
Floating-Point Environment Register, 2-14—2-15 
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floating-point instructions, 2-6 
Floating-Point Status Register, 2-18—2-20 
floating-point status results, 2-18 
general-purpose registers, 2-8—2-11 
global registers, 2-10 

Indirect Pointer A Register, 2-13 
Indirect Pointer B Register, 2-14 
Indirect Pointer C Register, 2-13 
indirect register addressing, 2-12—2-14 
instruction environment, 2-14—2-16 
instruction scheduling, 5-1—5-6 
instruction set, 2-1-2-8 

integer arithmetic instructions, 2-1 
Integer Environment Register, 2-15—2-16 
integer multiplication and division, 2-19-2-24 
large jump and call ranges, 2-25 

local registers, 2-10—2-11 

local-register stack pointer, 2-11-2-28 
logical instructions, 2-4 

logical operation status results, 2-17-2-18 
miscellaneous instructions, 2-7 
multiplication, 2-20-2-22 

multiprecision integer operations, 2-25 
NO-OPs, 2-25 | 

operating-system calls, 2-24—2-25 
pipelining, 5-1—5-6 

procedure linkage, 4-1-4-17 » 
processor initialization, 2-26—2-28 

Q Register, 2-20 

register addressing, 2-10 

register model, 2-8—-2-12 

register usage convention, 4-13—4-14 
reserved instructions, 2-8 

reset mode, 2-27-2-28 

run-time checking, 2-24 

run-time stack organization, 4-1—4-6 
shift instructions, 2-4 

special-purpose registers, 2-11—2-13 
status results of instructions, 2-16—2-19 
trapping arithmetic instructions, 2-26 
virtual arithmetic processor, 2-26 

virtual registers, 2-26 


protection of registers. See system protection 


Protection Violation trap 


assert instructions, 2-1 

protected special-purpose registers, 2-12. 
Supervisor mode, 6-1 . 

User mode, 6-1 

virtual registers, 2-26 


PSI bit (Page Sync Invert), 15-2 
PSIO bit (Page Sync Input/Output), 15-2 


PSL bit (Page Sync Level), 15-2 


PSTROBE signal 
definition, 7-5 
signal level, 13-3 
timing hardware handshakes, 13-3 
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PSYNC signal 
definition, 7-6 
signal level, 15-2 


PWE signal, definition, 7-5 


Q 


-Q field (Quotient/Multiplier), 2-20. 
Q Register, description, 2-20 
QEN bit (Queue Enable), 11-3 


R 


RW signal, definition, 7-2 

rab. See register allocate bound (rab) 
RAS3-RASO signals, definition, 7-3 
RDATA field (Receive Data), 14-5 
RDR bit (Receive Data Ready), 14-3 
Read/Write signal. See R/W signal 
Receive Data signal. See RXD signal 
REFRATE field (Refresh Rate), 9-2 


register allocate bound pointer (rab) 
definition, 4-5 
register conventions, 4-14 
Register Bank Protect Register 
description, 6-2-6-3 
protecting general-purpose registers, 6-1-6-3 


register free bound pointer (rfb) 
definition, 4-5 
register conventions, 4-14 


register number, 2-10 


Register Stack 
activation record, 4-3 
description, 4-3 
local registers as a stack cache, 4-4-4-5 
local registers for caching, 4-4—4-5 
local variables and memory-stack frames, 
4-11-4-12 


procedure prologue for frame allocation, 4-8-4-9 


telationship to stack cache (figure), 4-4 
storage allocation, 4-2 


register stack pointer (rsp). 
definition, 4-5 . 
register conventions, 4-13 


register summary, special-purpose registers, B-3—B-6, 


C-1-C-5 


registers 
addressing, 2-10 
addressing indirectly, 2-12—2-14 
ALU Status (ALU, Register 132), 2-16—2-17 
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bank organization, B-2 

Baud Rate Divisor (BAUD, adress 80000090), 
14-5 

Byte Pointer (BP, Register 133), 3-2-3-3 

Channel Address (CHA, Register 4), 16-18 

Channel Control (CHC, Register 6), 16-18-16-19 

Channel Data (CHD, Register 5), 16-18 | 

Configuration (CFG, Register 3), 2-26-2-27 

Current Processor Status (CPS, Register 2), 
16-1-16-3 


. delayed effects, 5-5 


DMAO Address (DMADO, Address 80000034), 11-4 
DMAO Address Tail (TADO, Address 80000070), 
11-4 


- DMAO Control (DMCTO, Address 80000030), 


11-1-11-4 
DMAO Count (DMCNO, Address 80000038), 11-5 
DMAO Count Tail (TCNO, Address 8000003C), 11-5 
DMA1 Address (DMAD1, Address 80000044), 11-7 
DMA1 Control (DMCT1, Address 80000040), 
11-5-11-7 
DMA1 Count Register (DMCN1, Address 
80000048), 11-7 


DRAM Configuration (DRCF, Address 8000000C), 


9-2-9-3 

DRAM Contro! (ORCT, Address 80000008), 9-1-9-2 

DRAM Mapping 0 (DRMO, Address 80000010), 
9-3-9-4 

DRAM Mapping 1 (DRM1, Address 80000014), 9-4 

DRAM Mapping 2 (DRM2, Address 80000018), 9-4 

DRAM Mapping 3 (DRM3, Address 8000001C), 9-4 

Floating-Point Environment (FPE, Register 160), 
2-14—2-15 

Floating-Point Status (FPS, Register 162), 
2-18-2-19 

Funnel! Shift Count (FC, Register 134), 3-3-3-4 

general-purpose, 2-8-2-11 

global, 2-10 

Indirect Pointer A (IPA, Register 129), 2-13 

Indirect Pointer B (IPB, Register 130), 2-14 

Indirect Pointer C (IPC, Register 128), 2-13 

Integer Environment (INTE, Register 161), 
2-15-2-16 | 

Interrupt Control (ICT, Address 80000028), 
16-23-16-24 

Load/Store Count Remaining (CR, Register 136), 
3-11 

local, 2-10—2-11 

Old Processor Status (OPS, Register 1), 16-6 

Parallel Port Control (PPCT, Address 800000C0), 
13-1-13-3 

Parallel Port Data (PPDT, Address 800000C4), 13-4 

Parallel Port Status (PPST, Address 800000C8), 
13-3-13-4 _ 

peripheral register address assignments, 7-8-—7-10 

peripheral register summary, C-1-C-10 

PIA Control 0 (PICTO, Address 80000020), 
10-1-10-2 

PIA Control 1 (PICT1, Address 80000024), 
10-1-10-2 


PIO Control (POCT, Address 800000D0), 12-1—12-2 

PIO Input (PIN, Address 800000D4), 12-2 

PIO Output (POUT, Address 800000D8), 12-2—12-3 

PIO Output Enable (POEN, Address 800000DC), 
12-3 

processor register summary, B-1—B-9 

Program Counter 0 (PCO, Register 10), 16-9 

Program Counter 1 (PC1, Register 11), 16-9 

Program Counter 2 (PC2, Register 12), 16-10 

protection, 6-1-6-3 

Q (Q, Register 131), 2-20 

Register Bank Protect (RBP, Register 7), 6-2—6-3 

register usage conventions, 4-13—4-14 

reserved fields, A-1 

ROM Configuration (RMCF, Address 80000004), 
8-2--8-3 

ROM Control (RMCT, Address 80000000), 8-1—8-2 

Serial Port Control (SPCT, Address 80000080), 
14-1-14-3 

Serial Port Receive Buffer (SPRB, Address 
8000008C), 14-4—14-5 

Serial Port Status (SPST, Address 80000084), 
14-3-14-4 

Serial Port Transmit Holding Soa Address 
80000088), 14-4 

Side Margin (SIDE, Address 800000E8), 15-3 

special-purpose, 2-11—2-13, B-3-—B-6 

Timer Counter (TMC, Register 8), 16-22-16-23 

Timer Reload (TMR, Register 9), 16-23 

Top Margin (TOP, Address .800000E4), 15-3 

Vector Area Base Address (VAB, Register 0), 16-5 

Video Control (VCT, Address 800000E0), 15-1-15-3 

Video Data Holding (VDT, Address 800000EC), 
15-3-15-4 

virtual, 2-26 


reserved instructions, table, 2-8 
Reset mode, 2-27—2-29 


RESEF signal 
definition, 7-2 
invoking Reset mode, 2-27-2-28 


Reset signal. See RESET signal 
rfb. See register free bound pointer (rfb) 
RM bit (Floating-Point Invalid Operand Mask), 2-15 


RMODE field (Receive Mode) 
Serial Port Control Register, 14-3 
serial port initialization, 14-5 


ROM accesses 
burst-mode accesses, 8-8 
byte writes, 8-7-8-8 
extending ROM cycles, 8-8 
narrow ROM accesses, 8-4—8-6 
ROM address mapping, 8-4 
simple ROM accesses, 8-4 
simple writes, 8-7 
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ROM Chip Selects, Banks 3-0 signals. See 
ROMCS3—-ROMCSO signals 


ROM Configuration Register, description, 8-2-8-3 
ROM Control Register, description, 8-1—8-2 








ROM controller 


See also ROM accesses 
address mapping, 8-4 
initialization, 8-3-8-4 
overview, 8-1 
programmable registers 
ROM Configuration Register, 8- 2-8-3 
ROM Control Register, 8-1—-8-2 
signals 
BOOTW, 7-3 
BURST, 7-3. 
ROMCS3-—ROMCSO, 7-3 
ROMOE, 7-3 
RSWE, 7-3 


ROM Output Enable signal. See ROMOE Sonia 
ROMCS3-ROMCSO signals, definition, 7-3 
ROMOE signal, definition, 7-3 

round mode, 2-14 




















_ Row Address Strobe, Banks 3-0 signals. See 


RAS3—RASO signals 
RS bit (Floating-Point Reserved Operand Sticky), 
2-19 


RSIE bit (Receive Status Interrupt Enable), Serial Port 
Control Register, 14-2 


rsize value, 4-8 

rsp. See register stack pointer (rsp) 

RSWE signal, definition, 7-3 | 

RT bit (Floating-Point Reserved Operand Trap), 2-19 
run-time checking, 2-24 


run-time organization, register usage conventions, 
4-13—-4-14 
run-time stack 
activation records, 4-1 
allocation of storage locations, 4-2 
definition, 4-1—-4-6 
local registers as a stack cache, 4-4—4-5 
management, 4-1—4-2 | 
memory stack, 4-5-4-7. 
Register Stack, 4-3 . 
stack cache, 4-4—4-6 
stack overflow, 4-6 


RW bit (Read/Write), 11-3, 11-7 

RXD signal, definition, 7-6 

RXDI bit (Serial Port Receive Data Interrupt), 16-24 
RXSI bit (Serial Port Receive Status Interrupt), 16-24 
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SAMPLE instruction, 17-7 

SB bit (Set Byte Pointer/Sign Bit), 3-8 
SC bit (Static-Column DRAM), 9-2 
SDIR bit (Shift Direction), 15-2 


serial port 
clock specification, 14-5 
initialization, 14-5 
internal OMA transfers, 11-1, 11-8-11-12, 14-2, 14-3 
overview, 14-1 
programmable registers 
_ Baud Rate Divisor Register, 14-5 
Serial Port Control Register, 14-1-14-3 
Serial Port Receive Buffer Register, 14-4-14-5 
Serial Port Status Register, 14-3—14-4 
Serial Port Transmit Holding Register, 14-4 
signals 
DSR, 7-6 
DTR, 7-6 
RXD, 7-6 
TXD, 7-6 
UCLK, 7-6 


Serial Port Control Register, description, 14-1-14-3 


Serial Port Receive Buffer Register, description, 
14-4-14-5 


Serial Port Status Register, description, 14-3-14-4 


serial Port Transmit Holding Register, description, 
14-4 


serializer/deserializer. See video interface 


SETIP (Set Indirect Pointers) instruction, description, 
18-105 


shift instructions 
EXTRACT (Extract Word, Bit-Aligned), 18-64 
overview, 2-4 
SLL (Shift Left Logical), 18-106 
SRA (Shift Right Arithmetic), 18-108 
SRL (Shift Right Logical), 18-109 
table, 2-4 


Side Margin Register, description, 15-3 
signal description, 7-1—7-7 


Signals 
A23—A0, 7-1 
access priority, 7-7—7-8 
BOOTW, 7-3 
BURST, 7-3 
CAS3-CASO, 7-4 
DACK1—DACKO, 7-4-7-5 
DREQ1—DREQO, 7-4 
DSR, 7-6 
DTR, 7-6 
GACK, 7-5 
GREQ, 7-5 








1D31-!DO, 7-1 
INCLK, 7-1 
INTR3-INTRO, 7-2 
LSYNC, 7-6 
MEMCLK, 7-1 
PACK, 7-5 
PAUTOFD, 7-5 
PBUSY, 7-5 
PIACS3—PIACSO, 7-4 
PIAOE, 7-4 
PIAWE, 7-4 
PIO15—PIO0, 7-5 
POE, 7-5 
PSTROBE, 7-5 
PSYNC, 7-6 

PWE, 7-5 

RM, 7-2 
RAS3-RASO, 7-3 
RESET, 7-2 
ROMCS3—ROMCSO, 7-3 
ROMOE, 7-3 
RSWE, 7-3 

RXD, 7-6 
STAT2—STATO, 7-2 
TCK, 7-6 

TDI, 7-6 

TDMA, 7-5 

TDO, 7-7 

TMS, 7-6 

TR/OE, 7-4 
TRAP1—TRAPO, 7-3 
TRIST, 7-2 : 
TRST, 7-7 

TXD, 7-6 





























WAIT, 7-1 
_ WAIT/TRIST, 7-2 
WARN, 7-2 
WE, 7-4 
size value, 4-8 
SLL (Shift Left Logical) instruction, description, 18-106 
slp. See static link pointer (slp) 
SM bit (Supervisor Mode), 16-3 
special-purpose registers. See registers 


special-purpose registers 
organization, 2-12 
overview, 2-11-2-12 
spill handler, 4-10 


SQRT (Floating-Point Square Root) instruction, 
description, 18-107 


SRA (Shift Right Arithmetic) instruction, description, 


18-108 
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SRL (Shift Right Logical) instruction, description, 
18-109 


ST bit (Set), 16-19 
stack. See run-time stack 
stack cache 


definition, 4-2 
relationship to Register Stack (figure), 4-4 
stack overflow, 4-5 
Stack Pointer 
allocating activation records, 4-4 
definition, 2-11 
delayed effects of registers, 5-5 
protection, 2-25 
stack underflow, 4-5 
STAT2-STATO signals 
boundary-scan cells, 17-5 
definition, 7-2 
Halt mode, 17-11-17-12 
ICTEST1 scan path, 17-10 
ICTEST2 scan path, 17-10—17-11 
Load Test Instruction mode, 17-13-17-14 
processor status outputs, 17-2-17-3 | 
Step mode, 17-12-17-13 
Static link pointer (slp) 
description, 4-12-4-13 
register conventions, 4-13 
Static parent, 4-12—4-13 
STB bit (PSTROBE Level), 13-3 
Step mode, 17-12—17-13 
STORE (Store) instruction, description, 18-110 
Store instructions. See load and store instructions 


STOREL (Store and Lock) instruction, description, 
48-111 | 


STOREM (Store Multiple) instruction 
description, 18-112 
multiple data accesses, 3-9-3-11 


STP bit (Stop Bits), 14-2 
SUB (Subtract) instruction, description, 18-113 


SUBC (Subtract with Carry) instruction, description, 
18-114 


SUBCS (Subtract with Carry, Signed) instruction, 
description, 18-115 


SUBCU (Subtract with Carry, Unsigned) instruction, 
description, 18-116 


SUBR (Subtract Reverse) instruction, description, 
18-117 


SUBRC (Subtract Reverse with Carry) instruction, 
description, 18-118 
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SUBRCS (Subtract Reverse with Carry, Signed) 
instruction, description, 18-119 


SUBRCU (Subtract Reverse with Carry, Unsigned) 
instruction, description, 18-120 


SUBRS (Subtract Reverse, Signed) instruction, 
description, 18-121 


SUBRU (Subtract Reverse, Unsigned) instruction, 
description, 18-122 


SUBS (Subtract, Signed) instruction, description, 
18-123 


subtraction instructions 
DSUB (Floating-Point Subtract, Double-Precision), 
_ 18-59 | 

FSUB (Floating-Point Subtract, Single-Precision), 
18-72 

SUB (Subtract), 18-113 

SUBC (Subtract with Carry), 18-114 

SUBCS (Subtract with Carry, Signed), 18-115 

SUBCU (Subtract with Carry, Unsigned), 18-116 

SUBR (Subtract Reverse), 18-117 

SUBRC (Subtract Reverse with Carry), 18-118 

SUBRCS (Subtract Reverse with Carry, Signed), 

18-119 

SUBRCU (Subtract Reverse with Carry, Unsigned), 
18-120 

SUBRS (Subtract Reverse, Signed), 18-121 » 

SUBRU (Subtract Reverse, Unsigned), 18-122 

SUBS (Subtract, Signed), 18-123 

SUBU (Subtract, Unsigned), 18-124 


SUBU (Subtract, Unsigned) instruction, description, 
18-124 


Supervisor mode, overview, 6-1 
support. See product support 
system address partition, 7-8 


system overview 
access priority, 7-7—7-8 
internal peripheral address assignments, 7-8-7-10 
internal peripherals and controllers, 7-8—7-9 
pin changes, 7-7 
signal description, 7-1—7-7 
clocks, 7-1 
DMA controller, 7-4—7-5 
DRAM interface, 7-3-7-4 | 
I/O port, 7-5 : 
JTAG 1149.1 boundary scan interface, 7-6—7-7 
parallel port, 7-5 
Peripheral Interface Adapter (PIA), 7-4 
processor signals, 7-1—7-3 
ROM interface, 7-3 
serial port, 7-6 
video interface, 7-6 | 
system address partition, 7-8 


system protection 
general-purpose registers, 6-1—6-3 
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overview, 6-1 
special-purpose registers, 2-11 


T 


tav. See trap handler argument (tav) 
TCK signal | 

definition, 7-6 

required setting, A-1 


TCV field (Timer Count Value), 16-22—1 6-23 
TD bit (Timer Disable), 16-2 

TDATA field (Transmit Data), 14-4 

TDELAY field (Transfer Delay), 13-2 
TDELAYV field (TDELAY Counter Value), 13-3 


TDI signal 
definition, 7-6 
required setting, A-1 
TDMA signal 
definition, 7-5 
DMA transfer count, 11-5 
processor interrupt, 11-3, 11-7 
terminating external DMA transfers, 11-3, 11-7 


TDO signal, definition, 7-7 


TE bit (Trace Enable) 
control of tracing, 17-1~-17-2 
Current Processor Status Register, 16-2 


TEMT bit (Transmitter Empty), 14-3 
Terminate DMA signal. See TDMA signal 


Test Access Port, 17-4—17-11 
boundary-scan cells, 17-4—17-5 
BYPASS instruction, 17-8 
EXTEST instruction, 17-6 
ICTEST1 instruction, 17-7 
ICTEST2 instruction, 17-7 
. implemented instructions, 17-6—17-8 
instruction register, 17-6—17-8 
INTEST instruction, 17-6—17-7 
SAMPLE instruction, 17-7 
scan paths, 17-8-17-11 
Test Clock Input signal. See TCK signal 
Test Data Input signal. See TDI signal 
Test Data Output signal. See TDO signal 
Test Mode Select signal. See TMS signal 
Test Reset Input signal. See TRST signal 
THRE bit (Transmit Holding Register Empty), 14-3 
Three-State Control signal. See WAIT/TRIST signal 
Timer Counter Register, description, 16-22-—16-23 


Timer Facility 
disabling Timer interrupts, 16-2 


initialization, 16-21 

operation, 16-21 

overview, 16-21 

Timer Counter Register, 16-22-16-23 
Timer Reload Register, description, 16-23 
uses, 16-22 , 


Timer interrupt, 16-22 


Timer Reload Register, description, 16-23 


TMODE field (Transmit Mode) 
Serial Port Control Register, 14-2 
serial port initialization, 14-5 


TMS signal 
definition, 7-6 
required setting, A-1 


Top Margin Register, description, 15-3 
TOPCNT field (Top Margin Count), 15-3 


TP bit (Trace Pending) 
control of tracing, 17-1-17-2 
Current Processor Status Register, 16-2 


tpc. See trap handler return address (tpc) 
TR field (Target Register), 16-19 

TR/OE signal, definition, 7-4 

TRA bit (Transfer Active), 13-2 

Trace Facility, 17-1—17-2 

trace-back tags, 4-15—4-17 


_ Transmit Data signal. See TXD signal 


trap handler argument (tav) 
description, 4-10 
register conventions, 4-13 


trap handler return address (tpc) 
description, 4-10 
register conventions, 4-13 


Trap Requests 1-0 signals. See TRAP1—TRAPO 
signals | 

trap status bits, Floating-Point Exception trap, 
2-18-2-19 


TRAP1—TRAPO signals 
definition, 7-3 
traps, 16-4 


traps 
See also interrupts and traps; specific trap names 
EMULATE (Trap to Software Emulation Routine), 
18-60 | 
enabling and disabling, 16-4 
external traps, 16-4 
Floating-Point Exception trap, 16-20 
Illegal Opcode trap, 16-4, 17-2 
Out-of-Range trap, 16-16, 16-19-16-20 
priority table, 16-15 
Protection Violation trap, 6-1 
TRAP1—TRAPO signals, 16-4 
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trapping Arithmetic instructions, 2-26 
Unaligned Access trap, 3-13, 16-2 
WARN trap, 16-13—16-14 


TRST signal 
. definition, 7-7 
required setting, A-1 


TRV field (Timer Reload Value), 16-23 

TTE bit (TDMA Terminate Enable), 11-3, 11-7 

TTI bit (TDMA Terminate Interrupt), 11-3, 11-7 

TU bit (Trap Unaligned Access), 16-2 

TXD signal, definition, 7-6 | 
TXDI bit (Serial Port Transmit Data Interrupt), 16-24 


U 


UART Clock signal. See UCLK signal 


UCLK signal 
definition, 7-6 
required setting, A-1 
serial clock divisor, 14-5 


UD bit (Transfer Up/Down), 11-3, 11-6—11-7 
_ UM bit (Floating-Point Underflow Mask), 2-15 


Unaligned Access trap, 16-2 
OPT field values, 3-13 


underflow. See fill handlers 
Universal Debug Interface (UDI), 1-7 


UNIX common object file format (COFF), extensions, 
1-7 


US bit (Floating-Point Underflow Sticky), 2-19 
User mode, overview, 6-1—6-2 
UT bit (Floating-Point Underflow Trap), 2-19 


V 


V bit (Overflow) 
ALU Status Register, 2-16 
arithmetic operation status results, 2-17 


VAB field (Vector Area Base), 16-5 — 
VALID bit (Valid Mapping), 9-3 


VCLK signal 
definition, 7-6 
video clock divisor, 15-1 


VDAT signal, definition, 7-6 
VDATA field (Video Data), 15-4 
VDI bit (Video Interrupt), 16-24 


Vector Area Base Address Register, description, 16-5 


vector numbers 
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assignments (table), 16-7—16-9 
specifying, 2-24 


Video Clock signal. See VCLK signal 

Video Control Register, description, 15-1—15-3 
Video Data Holding Register, description, 15-3-15-4 
Video Data signal. See VDAT signal 


Video DRAM Transfer/Output Enable signal. See 
TR/OE signal 


video DRAM transfers, 9-12 


video interface — 
clock specification, 15-4—15-5 
initialization, 15-4—15-5 
internal DMA transfers, 11-1, 11-8—11-12, 15-2 
operation, 15-4—15-7 
overview, 15-1 
programmable registers 
Side Margin Register, 15-3 
Top Margin Register, 15-3 
Video Control Register, 15-1—15-3 
Video Data Holding Register, 15-3-15-4 
receiving data, 15-6—15-7 , 
signals 
LSYNC, 7-6 
PSYNC, 7-6 
VCLK, 7-6 
VDAT, 7-6 
transmitting data, 15-5—15-6 


VIDI bit (Video Invert), 15-3 


_ VIRTBASE field (Virtual Base Address), 9-3 


VM bit (Floating-Point Overflow Mask), 2-15 
VS bit (Floating-Point Overflow Sticky), 2-19 
VT bit (Floating-Point Overflow Trap), 2-19 


W 


Wait mode, 16-4—16-5 


WAIT signal 
definition, 7-1 
extending PIA 1/O cycles, 10-3-10-4 
figures, 10-5 
extending ROM cycles, 8-8 
figures, 8-11 — ) 
external DMA transfers, 11-9-11-11 
WAIT/TRIST signal 
See also WAIT signal . 
definition, 7-2 
required setting, A-1 


WARN signal 


definition, 7-2 
description, 16-14 


WARN trap, 16-13-16-14 
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WE signal, definition, 7-4 | 
WLGN field (Word Length), 14-2 
WM bit (Wait Mode), 16-3 

Write Enable signal. See WE signal 
WSO field (Wait States, Bank 0), 8-2 
WS1 field (Wait States, Bank 1), 8-2 
WS2 field (Wait States, Bank 2), 8-2 
WSS field (Wait States, Bank 3), 8-2 


X 


XM bit (Floating-Point Inexact Result Mask), 2-15 


XNOR (Exclusive-NOR Logical) instruction, 
description, 18-125 


XOR (Exclusive-OR Logical) instruction, description, 
18-126 


XS bit (Floating-Point Inexact Result Sticky), 2-19 
XT bit (Floating-Point Inexact Result Trap), 2-19 


Z 


Z bit (Zero) 
ALU Status Register, 2-16 
arithmetic operation status results, 2-17 
logical operation status results, 2-17~2-18 
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